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5 TRANSCYTOSIS OF ADENO-ASSOCIATED VIRUSES 

FIELD OF THE INVENTION 

This invention relates generally to the ability of AAV vectors to transcytose 
10 epithelial barriers. 

BACKGROUND OF THE INVENTION 

The adeno-associated viruses (AAV) were originally classified according to size, 

15 structure, and dependence upon a helper virus for replication. AAV is a member of the 
Parvoviridae, a virus family characterized by a single stranded linear DNA genome and a 
small icosahedral shaped capsid measuring about 20nm in diameter. AAV was first 
described as a contaminant of tissue culture grown simian virus 1 5, a simian adeno virus 
and was found dependent on adenovirus for measurable replication. This led to its name, 

20 adeno-associated virus, and its classification in the genus Dependovirus. Because the 
majority of AAV isolates were first identified as contaminants of laboratory stocks of 
adenovirus, little is known about their natural tissue tropism. However in vivo experiments 
suggest they are effective vectors for gene transfer applications. Currently eleven full-length 
isolates have been cloned and their initial characterization indicates that each serotype has 

25 unique binding/cell tropism characteristics. 

Transcytosis is the transport of macromolecular cargo from one side of a cell to the 
other within membrane-bounded carrier(s). It is a strategy used by multicellular organisms 
to selectively move material between two different environments while maintaining the 
distinct compositions of those environments. The ability of a pathogen to spread through a 

30 tissue is a critical determinate of its virulence. The process of transcytosis has been reported 
for-a-number of viruses. For example, HIV and poliovirus cross simple epithelial cells 
w.Ute^t infection and are still infectious when they cross into the submucosa. Likewise, the 
Epstein-Barr virus (EBV) forms a complex with mucosal immunoglobulins (IgA) that are 
specific for gp350, a viral surface protein that is present in latently infected people. This 

35 complex binds to the poly-immunoglobulin receptor at the basal surface of epithelial cells, 

1 
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5 and is endocytosed and delivered apically without infection. To date, there is no report of 
transcytosis by any AAV. 

Provided herein are methods for transcytosis across barrier epithelial cells using 
AAV vectors. The ability of a non-pathogenic vector to transcytose barrier epithelial cells 
can be used to deliver genes to sub-epithelial targets. One important example includes the 
10 delivery of genes across the blood-brain-barrier without the need for direct injection into the 
brain. Furthermore, herein is described a method for re-directing virus that enters a cell by 
transcytosis to result in transduction of the cell by blocking exocytosis. 

SUMMARY OF THE INVENTION 

15 

In accordance with the purpose(s) of this invention, as embodied and broadly 
described herein, this invention, in one aspect, relates to a method of delivering a 
heterologous nucleic acid across an epithelial barrier comprising delivering to the epithelial 
barrier an AAV vector comprising the heterologous nucleic acid. The epithelial cells can be 
20 in the gut, lung, genitourinary tract, kidney, blood vessels or brain. 

In another aspect, the invention relates to a method of transcytosing epithelial cells 
of a human subject comprising administering to the subject a viral vector comprising a 
heterologous nucleic acid, wherein the viral vector is selected from a group consisting of 
BAAV, AAV4 or AAV5. 
25 In yet another aspect, the invention relates to a method of delivering a heterologous 

nucleic acid across human airway epithelial cells, comprising delivering to the cells a 
BAAV vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across human cerebral microvascular endothelial cells, comprising delivering to 
30 the cells a BAAV vector comprising the nucleic acid. 

In yet another aspecMheonvention relates to~a method of delivering a heterologous 
nucleic acid across human e^gmetri^gggjjgggj^ cells comprising delivering to the cells a 
BAAV vector comprising the nucleic acid. 

In yet another aspectrthe invention relates to a method of delivering a heterologous 
35 nucleic acid across human kidney epithelial cells, comprising delivering to the cells a 
BAAV vector comprising the nucleic acid. 

2 
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5 In yet another aspect, the invention relates to a method of delivering a heterologous 

nucleic acid across human absorptive enterocytes, comprising delivering to the cells an 
AAV5 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across human airway epithelial cells, comprising delivering to the cells an 
1 0 AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across human cerebral microvascular endothelial cells, comprising delivering to 
the cells a AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
1 5 nucleic acid across human endometrial epithelial cells, comprising delivering to the cells an 
AA V4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across human kidney epithelial cells, comprising delivering to the cells an 
AAV4 vector comprising the nucleic acid. \ 
20 In yet another aspect, the invention relates to a method of delivering a heterologous 

nucleic acid across human absorptive enterocytes comprising delivering to the cells an 
AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier of the lung, comprising delivering to the lung a 
25 BAAV vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier in the brain, comprising delivering to the brain a 
BAAV vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
30 nucleic acid across the epithelial barrier of blood vessels into the muscle, comprising 
delivering to the blood stream a BAAV vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier in the genitourinary tract, comprising delivering to 
the genitourinary tract a BAAV vector comprising the nucleic acid genitourinary tract. 
35 Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

barrier in the kidney, comprising delivering to the genitourinary tract a BAAV vector 

3 
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5 comprising the nucleic acid. In one aspect of the method, the epithelial barrier comprises 
human renal collecting ducts or proximal tubules. 

In yet another aspect, the invention relates to a method of transcytosing lung 
epithelial cells of a subject comprising contacting the lung epithelial cells of the subject with 
a BAAV vector comprising a heterologous nucleic acid. 
10 In yet another aspect, the invention relates to a method of transcytosing CNS 

epithelial cells of a subject comprising contacting the CNS epithelial cells of the subject 
with a BAAV vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing vascular 
epithelial cells of a subject comprising contacting the vascular epithelial cells of the subject 
1 5 with a BAAV vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing genitourinary 
tract epithelial cells of a subject comprising contacting the genitourinary tract epithelial cells 
of the subject with a BAAV vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing kidney 
20 epithelial cells of a subject comprising contacting the kidney epithelial cells of the subject 
with a BAAV vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier in the gut, comprising delivering to the gut an AAV5 
vector comprising the nucleic acid. 
25 In yet another aspect, the invention relates to a method of transcytosing gut epithelial 

cells of a subject comprising contacting the gut epithelial cells of the subject with an AAV5 
vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier in the gut, comprising delivering to the gut an AAV4 
30 vector comprising the nucleic acid. 

In yet another aspect, the invention relates4o a method of delivering a heterologous 
nucleic acid across an epithelial barrier i*» *he Umg, comprising delivering to the lung an 
AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
35 nucleic acid across an epithelial barrier in the CNS, comprising delivering to the CNS an 
AAV4 vector comprising the nucleic acid. 

4 
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5 In yet another aspect, the invention relates to a method of delivering a heterologous 

nucleic acid across the epithelial barrier of blood vessels into the muscle, comprising 
delivering to the blood stream an AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier in the genitourinary tract, comprising delivering to 
1 0 the genitourinary tract an AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of delivering a heterologous 
nucleic acid across an epithelial barrier in the kidneys, comprising delivering to the kidneys 
an AAV4 vector comprising the nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing lung 
15 epithelial cells of a subject comprising contacting the lung epithelial cells of the subject with 
an AAV4 vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing CNS 
epithelial cells of a subject comprising contacting the CNS epithelial cells of the subject 
with an AAV4 vector comprising a heterologous nucleic acid. 
20 In yet another aspect, the invention relates to a method of transcytosing vascular 

epithelial cells of a subject comprising contacting the vascular epithelial cells of the subject 
with an AAV4 vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing genitourinary 
tract epithelial cells of a subject comprising contacting the genitourinary epithelial cells of 
25 the subject with an AAV4 vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing kidney 
epithelial cells of a subject comprising contacting the kidney epithelial cells of the subject 
with an AAV4 vector comprising a heterologous nucleic acid. 

In yet another aspect, the invention relates to a method of transcytosing gut epithelial 
30 cells of a subject comprising contacting the gut epithelial cells of the subject with an AAV4 
vector comprising a heterologous nucleic acid. 

Additional advantages of the invention will be set forth in part in the description 
which follows, and in part will be obvious from the description, or may be learned by 
practice of the invention. The advantages of the invention will be realized and attained by 
35 means of the elements and combinations particularly pointed out in the appended claims. It 
is to be understood that both the foregoing general description and the following detailed 

5 
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5 description are exemplary and explanatory only and are not restrictive of the invention, as 
claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

1 0 The accompanying drawings, which are incorporated in and constitute a part of this 

specification, illustrate (one) several embodiment(s) of the invention and together with the 

description, serve to explain the principles of the invention. 

Figure 1 shows that AAV4 transcytosed in CaCo-2, MDCKI, MDCKII, Human 

primary immortalized epithelial endometrial, Bovine brain primary endothelia cells (BBB). 
1 5 AAV5 transcytosed CaCo-2 cells, whereas BAAV transcytosed in MDCKs, Endometrial, 

airways epithelia, and BBB. AAV6 did not transcytose in any of cell types tested. Hela cells 

do not form barrier epithelia and were used as a control. 

Figure 2 shows that the treatment of the basal lateral surface of Human primary 

airways epithelial cell (HAE) with tannic acid blocked the transcytosis of BAAV vector 
20 containing a GFP expression cassette from the apical surface to the basal lateral. 

Furthermore transduction dramatically increased when assayed at 24 hrs post inoculation. 

In contrast no change was observed in AAV2 transduction, which did not demonstrate any 

transcytosis activity and has limited binding activity on HAE. 

25 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention may be understood more readily by reference to the following 
detailed description of the invention and the Examples included therein and to the Figures 
and their previous and following description. 

Before the present compounds, compositions, articles, devices, and/or methods are 
30 disclosed and described, it is to be understood that this invention is not limited to specific 
synthetic methods, specific cell types, or to particular tissues, as such may, of course, vary. 
It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only and is not intended to be limiting. 

As used in the specification and the appended claims, the singular forms "a," "an" 
35 and "the" include plural referents unless the context clearly dictates otherwise. Thus, for 

6 
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10 



15 



20 



25 



example, reference to "a pharmaceutical carrier" includes mixtures of two or more such 
carriers, and the like. 

Ranges may be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment 
includes from the one particular value and/or to the other particular value. Similarly, when 
values are expressed as approximations, by use of the antecedent "about," it will be 
understood that the particular value forms another embodiment. It will be further understood 
that the endpoints of each of the ranges are significant both in relation to the other endpoint, 
and independently of the other endpoint. 

"Optional" or "optionally" as used herein means that the subsequently described 
event or circumstance may or may not occur, and that the description includes instances 
where said event or circumstance occurs and instances where it does not. 

AAV TRANSCYTOSIS 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier comprising delivering to the epithelial barrier an AAV vector comprising the 
heterologous nucleic acid. In one aspect of the method, the AAV is AAV4, AAV5, or 
BAAV. In another aspect of the method, the epithelial cells are in the gut, lung, 
genitourinary tract, kidney, blood vessels or brain. In another aspect of the method, the 
epithelial cells can be selected from a group consisting of bronchial, alveolar, tracheal or 
upper airway epithelial cells; absorptive enterocytes or M cells; endometrial or urinary 
epithelial cells; renal collecting duct or proximal tubule epithelial cells; cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells. 

Further disclosed is a method of transcytosing epithelial cells of a human subject 
comprising administering to the subject an AAV vector comprising a heterologous nucleic 
acid. In one aspect of the method, the vector is AAV4, AAV5, or BAAV. In another aspect 
of the method, the epithelial cells are selected from a group consisting of bronchial, 
alveolar, tracheal or upper airway epithelial cells; absorptive enterocytes or M cells; 
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5 Further contemplated are methods for the delivery of molecules across epithelial cell 

barriers comprising coupling the molecules to non-recombinant (wild-type) AAV capsids or 
particles. In one aspect, the molecules are radioligands or enzymes. 

The term "adeno-associated virus (AAV)" is used herein to refer to a genus of 
viruses in the family Parvoviridae which are all defective viruses (unable to replicate by 

10 themselves) and depend on the co-infection of their host cell by other, nondefective viruses 
to help them replicate. 

The term "transcytosis" is used herein to mean the transport of macromolecular 
cargo from one side of a cell to the other within a membrane-bounded carrier(s). Tuma and 
Hubbard provided a review of transcytosis (Tuma PL and Hubbard AL. 2003. Physiol Rev. 

15 83:871-932), herein incorporated by reference for its teaching regarding the nature and uses 
for trancytosis. Transcytosis is a strategy used by multicellular organisms to selectively 
move material between two different environments while maintaining the distinct 
compositions of those environments. N. Simionescu was the first to coin the term 
transcytosis to describe the vectorial transfer of macromolecular cargo within the 

20 plasmalemmal vesicles from the circulation across capillary endothelial cells to the 

interstitium of tissues. During this same period, another type of transcytosis was being 
discovered. Immunologists comparing the different types of immunoglobulins found in 
various secretions (e.g., serum, milk, saliva, and the intestinal lumen) speculated that the 
form of IgA found in external secretions (called secretory IgA, due to the presence of an 

25 additional protein component) was selectively transported across the epithelial cell barrier. 
More is known about transcytosis as it is expressed in epithelial tissues, which form cellular 
barriers between two environments. In this polarized cell type, net movement of material 
can be in either direction, apical to basolateral or the reverse, depending on the cargo and 
particular cellular context of the process. However, transcytosis is not restricted to only 

30 epithelial cells. 

Since the 19th centuiy_dye experiments-a£Ehrlich, the brain has been known as a 
"privileged" organ where * c fr j"toWiii«jtfAp tfaH g ™ that the environment remains 

chemically stable. The two principal gatekeepers of the brain are the cerebral capillary 
endothelium and the cuboidai epithelial ceiis of the choroid plexus. These cellular barriers 

35 are specialized for the passage of different nutrients from the blood. The capillaries move 
nutrients that are required rapidly and in large quantities, such as glucose and amino acids. 

8 
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5 These small molecules are transported by membrane carriers using facilitated diffusion. The 
choroid plexus supplies nutrients that are required less acutely and in lower quantities. 
These are folate and other vitamins, ascorbate, and deoxyribonucleotides. 

There are two epithelial cells that participate in transcytosis in the intestine, M cells 
and enterocytes (adsorptive columnar cells). These cells are very different from one another 
10 and the capillary endothelial cell. Depending on the species, M cells comprise a variable but 
small percentage of the epithelia overlying organized mucosal-associated lymphoid tissue, 
making them a very minor cell population in the gastrointestinal tract. The transcytotic 
route across M cells is thought to be part of the mechanism by which antigens are routinely 
sampled along the entire mucosal surface. Not surprisingly, numerous pathogens have 
15 evolved mechanisms to exploit the transcytotic process as a means to invade and 
disseminate before a strong enough immune response can be mounted. 

Absorptive enterocytes are simple columnar cells with several apical features in 
addition to their brush borders. Clathrin-coated pits are present at the base of microvilli, and 
a thick glycocalyx composed of integral membrane proteins with glycosaminoglycan side 
20 chains emanates from the microvillar membrane. This latter structural feature as well as the 
rigidity of the microvilli are thought to prohibit microorganisms from attaching and 
invading enterocytes. The intracellular organization of these columnar epithelial cells is also 
polarized, with basally located nuclei, supranuclear Golgi, and an abundance of 
pleomorphic membrane compartments underlying the terminal web of the brush border. 
25 The basolateral-to-apical length of this cell is -20 versus 0.2 ^tm for a capillary endothelial 
cell, making the transcytotic route across enterocytes potentially much longer. Furthermore, 
microtubules are an important structural element of the transcytotic pathway in enterocytes, 
but not in M or endothelial cells. 

Transcytosis also occurs in the upper regions of the respiratory tract and has been 
30 demonstrated with two vector systems, plgA-R and FcRn, but others could exist. Secretory 
IgA is a known constituent of the lung's immune defense system, with bronchial epithelial 
cells carrying out basolateral-to-apical transport of dlgA, which is secreted by local plasma 
cells in underlying lymphoid tissue. Albumin, which is found in lung fluid, is endocytosed 
specifically at the apical surface of airway epithelia but is then subsequently degraded. At 
35 the alveolar level, the question of whether albumin is transcytosed intact is uncertain. 
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5 The methods and compositions described herein can be used to deliver heterologous 

nucleic acids to certain tissues. As used herein, the term "nucleic acid" refers to single-or 
multiple stranded molecules which may be DNA or RNA, or any combination thereof, 
including modifications to those nucleic acids. The nucleic acid may represent a coding 
strand or its complement, or any combination thereof. Nucleic acids may be identical in 

1 0 sequence to the sequences which are naturally occurring for any of the novel genes 

discussed herein or may include alternative codons which encode the same amino acid as 
those provided herein, including that which is found in the naturally occurring sequence. 
These nucleic acids can also be modified from their typical structure. Such modifications 
include, but are not limited to, methylated nucleic acids, the substitution of a non-bridging 

15 oxygen on the phosphate residue with either a sulfur (yielding phosphorothioate 

deoxynucleotides), selenium (yielding phosphorselenoate deoxynucleotides), or methyl 
groups (yielding methylphosphonate deoxynucleotides). 

As used herein, the term "isolated" refers to a nucleic acid separated or significantly 
free from at least some of the other components of the naturally occurring organism, for 

20 example, the cell structural components or viral components commonly found associated 
with nucleic acids in the environment of the virus and/or other nucleic acids. The isolation 
of the native nucleic acids can be accomplished, for example, by techniques such as cell 
lysis followed by phenol plus chloroform extraction, followed by ethanol precipitation of the 
nucleic acids. The nucleic acids of this invention can be isolated from cells according to any 

25 of many methods well known in the art. 

The AAV vectors disclose herein can comprise a heterologous nucleic acid 
functionally linked to the promoter. The term "heterologous" is used herein to refer to a 
nucleic acid which is derived from a different cell, tissue or organism. The nucleic acid can 
encode a polypeptide or protein or an antisense RNA, for example. By "functionally linked" 

30 is meant such that the promoter can promote expression of the heterologous nucleic acid, as 
is known in the art, such as appropriate orientation of the promoter relative to the 
heterologous nucleic acid. Fu rthennoc e. heterologous nucleic acid preferably has all 
appropriate sequences for expression of the nucleic acid, as known in the art, to functionally 
encode, i.e., allow the nucleic acid to be expressed. The nucleic acid can include, for 

35 example, expression control sequences, such as an enhancer, and necessary information 

10 
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5 processing sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, 
and transcriptional terminator sequences. 

The heterologous nucleic acid can encode beneficial proteins that replace missing or 
defective proteins required by the subject into which the vector in transferred or can encode 
a cytotoxic polypeptide that can be directed, e.g., to cancer cells or other cells whose death 

10 would be beneficial to the subject. The heterologous nucleic acid can also encode antisense 
RNAs that can bind to, and thereby inactivate, mRNAs made by the subject that encode 
harmful proteins. In one embodiment, antisense polynucleotides can be produced from a 
heterologous expression cassette in an AAV4 viral construct where the expression cassette 
contains a sequence that promotes cell-type specific expression (Wirak et aL 9 1991. EMBO 

1 5 1 0:289). For general methods relating to antisense polynucleotides, see Antisense RNA and 
DNA, D. A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988). 

Examples of heterologous nucleic acids which can be administered to a cell or 
subject as part of the present AAV4 vector can include, but are not limited to the following: 
nucleic acids encoding therapeutic agents, such as tumor necrosis factors (TNF), such as 

20 TNF- a; interferons, such as interferon-a, interferon-!}, and interferon-y; interleukins, such as 
IL-1, IL-ip, and ILs -2 through -14; GM-CSF; adenosine deaminase; cellular growth factors, 
such as lymphokines; soluble CD4; Factor VIII; Factor DC; T-cell receptors; LDL receptor; 
ApoE; ApoC; alpha- 1 antitrypsin; ornithine transcarbamylase (OTC); cystic fibrosis 
transmembrane receptor (CFTR); insulin; Fc receptors for antigen binding domains of 

25 antibodies, such as immunoglobulins; and antisense sequences which inhibit viral 

replication, such as antisense sequences which inhibit replication of hepatitis B or hepatitis 
non-A, non-B virus. The nucleic acid is chosen considering several factors, including the 
cell to be transfected. Where the target cell is a blood cell, for example, particularly useful 
nucleic acids to use are those which allow the blood cells to exert a therapeutic effect, such 

30 as a gene encoding a clotting factor for use in treatment of hemophilia. Furthermore, the 

nucleic acid can encode more than one gene product, limited only, if the nucleic acid is to be 
packaged in a capsid, by the size of nucleic acid that can be packaged. 

The term "polypeptide" as used herein refers to a polymer of amino acids and 
includes full-length proteins and fragments thereof. Thus, "protein," polypeptide," and 

35 "peptide" are often used interchangeably herein. Substitutions can be selected by known 
parameters to be neutral (see, e.g.. Robinson WE Jr, and Mitchell WM., 1990. AIDS 
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5 4:S15 1 -S 162). As will be appreciated by those skilled in the art, the invention also includes 
those polypeptides having slight variations in amino acid sequences or other properties. 
Such variations may arise naturally as allelic variations (e.g., due to genetic polymorphism) 
or may be produced by human intervention (e.g., by mutagenesis of cloned DNA 
sequences), such as induced point, deletion, insertion and substitution mutants. Minor 

10 changes in amino acid sequence are generally preferred, such as conservative amino acid 

replacements, small internal deletions or insertions, and additions or deletions at the ends of 
the molecules. Substitutions may be designed based on, for example, the model of Dayhoff, 
et al. (in Atlas of Protein Sequence and Structure 1978, Nat'l Biomed. Res. Found., 
Washington, D.C.). These modifications can result in changes in the amino acid sequence, 

1 5 provide silent mutations, modify a restriction site, or provide other specific mutations. 

The term "epithelia" is used herein to refer to cells which are linked tightly together 
by intercellular junctions to form a planar sheet. These sheets of cells form a barrier between 
two compartments. Epithelia therefore line all surfaces and cavities (including skin, 
peritoneum, linings of the intestine, airways, genitourinary tracts, glands, and blood vessels. 

20 An epithelium has a free or apical surface facing the environment, or lumen of a 

cavity, and a basal surface facing the underlying connective tissue. The boundary between 
the basal surface of an epithelium and the underlying connective tissue is usually very sharp, 
and is the site where the basal lamina (BL) is present. Most BL are too thin to be seen with 
the light microscope. However, the BL, together with a thin layer of connective tissue, is 

25 often times seen at the epithelial/connective tissue interface. This composite layer, visible 
with the light microscope, was initially called the Basement Membrane. Application of the 
electron microscope revealed that, in most cases, this Basement Membrane actually 
consisted of the true basal lamina (lamina lucida plus lamina densa), along with a layer of 
adherent connective tissue. 

30 For convenience of description, epithelia are classified into different types based on 

the number of cell layers and the cell shape. 

Epithelia which are 1 cell layer thick are called "simple" epithelia. Thus, each cell 
rests on the basal lamina, but also has a surface facing the lumen/outside world. Epithelia 
which are 2 or more cell layers thick are called "stratified" epithelia. In stratified epithelia, 

35 the basal layer of cells rests on the basal lamina, but subsequent layers do not, and are 

simply stacked on top of the basal layer. The cells of the most superficial layer have a free 
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5 surface, "squamous" cells are very flat, like a fried egg, where the yolk is the nucleus. The 
nucleus is distinctly flattened, the cell is often so thin that this flattened nucleus bulges the 
cell surface outward, "cuboidal" cells range from true cuboidal where the cell is about as 
high as it is wide, to a flattened cuboidal where the cell is wider than high. In cuboidal cells 
the nucleus is usually round, and not flattened as in squamous, "columnar" cells are 2 or 

10 more times as high as wide. Nucleus is usually elongated in the long axis of the cell. 

Squamous cells form the lining of cavities such as the mouth, blood vessels, heart 
and lungs and make up the outer layers of the skin. Cuboidal epithelium is found in glands 
and in the lining of the kidney tubules as well as in the ducts of the glands. They also 
constitute the germinal epithelium which produces the egg cells in the female ovary and the 

1 5 sperm cells in the male testes. Columnar epithelium forms the lining of the stomach and 
intestines. Some columnar cells are specialized for sensory reception such as in the nose, 
ears and the taste buds of the tongue. 

Ciliated columnar epithelial cells posses fine hair-like outgrowths, cilia on their free 
surfaces. These cilia are capable of rapid, rhythmic, wavelike beatings in a certain direction. 

20 Ciliated epithelium is usually found in the air passages like the nose. It is also found in the 
uterus and Fallopian tubes of females. 

Columnar epithelium with goblet cells is called glandular epithelium. Some parts of 
the glandular epithelium consist of such a large number of goblet cells that there are only a 
few normal epithelial cells left. Columnar and cuboidal epithelial cells often become 

25 specialized as gland cells which are capable of synthesizing and secreting certain substances 
such as enzymes, hormones, milk, mucus, sweat, wax and saliva. Unicellular glands consist 
of single, isolated glandular cells such as the goblet cells. Sometimes a portion of the 
epithelial tissue becomes invagtnated and a multicellular gland is formed. Multicellular 
glands are composed of clusters of cells. Most glands are multicellular including the salivary 

30 glands. 

Where body linings have to withstand wear and tear, the epithelia are composed of 
ssa&saggJ layers of cells and are then called compound or stratified epithelium. The top cells 
are flat and scaly and it may or may not be keratinized (i.e. containing a tough, resistant 
protein called keratin). The mammalian skin is an example of dry, keratinized, stratified 
35 epithelium. The lining of the mouth cavity is an example of an unkeratinized, stratified 
epithelium. 

13 

233334_17.DOC 



ATTORNEY DOCKET NO. 14014.0427U1 



In vitro Cell Models of Transcy tosis 

The use of in vitro cell models to study transcytosis has many advantages over in 
vivo systems. First, variation among animals is eliminated, as is the confounding issue of 
cargo possibly being modified or endocytosed by cell types other than the one under study. 

10 Moreover, in vitro systems can be manipulated in ways not possible in vivo, allowing 
investigators to measure the effects of different variables (e.g., temperatures, 
pharmacological agents, etc.) with greater precision and to explore the molecular 
mechanisms of transcytosis. 

The integrity of the monolayer is obviously vital to every study of transcytosis, and 

1 5 there are different methods for assessing it. Transepithelial electrical resistance (TER) 

measurements are commonly used as an indication of tight junction integrity in a monolayer, 
and commercial instruments are available for these measurements. 

Caco-2 cells, human primary colon carcinoma cells, are a well studied model of 
intestinal absorptive enterocytes. They are the most commonly used intestinal cell line 

20 because they differentiate furthest along the cryptto-villus axis and are the easiest to 

transfect. Caco-2 cells have been especially used to model transcytosis of bacteria, which 
can cross barrier epithelia in the gut and brain (Zhang JR, et al., 2000. Cell 102(6):827-37), 
incorporated herein by reference. 

There is little evidence for in vivo transcytosis of macromolecular cargo in kidney. 

25 Nonetheless, MDCK cells, which are derived from dog kidney, are the most-studied 

epithelial cell model and have been used extensively to study transcytosis. These cells were 
originally developed by nephrologists for permeability and electrical studies. Their 
subsequent use by cell biologists for studies of the formation of tight junctions, 
establishment of polarity, and vesicle traffic have popularized MDCK cells. An advantage is 

30 that MDCK cells are easily cultured, easily transfected, and become polarized 3-5 days after 
seeding. They were used in the now classical studies showing that enveloped viruses bud in 
a polarized fashion and that the newly synthesized viral membrane glycoproteins are 
targeted directly from the TON to the appropriate PM domain. Furthermore, much of the 
current understanding of the IgA transcytotic pathway and the sorting signals in the plgA-R 

35 comes from the elegant studies performed in MDCK cells. Two MDCK strains with very 
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5 different features were identified some time ago. The MDCK I cell has a high TER and 
characteristics reminiscent of the renal collecting duct, whereas the more commonly used 
MDCK II strain, whose TER is one order of magnitude lower than that of MDCK I cells, 
has phenotypic features closer to those of the renal proximal tubule. 

Both primary cells and cell lines, alone and in coculture with endothelial cells, are 
10 being used to study transcytosis in the lung. Clonetics bronchial/tracheal epithelial cell 

systems contain normal human bronchial/treacheal epithelial cells. This cell system has been 
used for experimental applications in cancer research, respiratory disease, cellular function 
and differentiation. 

The Clonetics® bovine Brain Microvascular Endothelial Cell System (bMVEC-B) is 

1 5 a model of the "Blood Brain Barrier". The system is designed to significantly improve a 
researcher's ability to study active and passive transport of drugs across the blood brain 
barrier, to study brain endothelial cell tight junctions, and to study the basic biology of brain 
microvascular endothelial cells (Schinket AH ,1999. Advanced Drug Delivery Reviews 
36:179-194; Tsukita S. et al., 1998. Molecular dissection of tight junctionsroccluding and 

20 ZO-1 in Introduction to the Blood -Brain Barrier. Edited by William M Partridge; Inglis et 
al., 2004. Brain Research 998: 218-229), each of which is incorporated by reference for its 
teaching of in vitro endothelial cell modeling of the blood-brain barrier. 

Endometrial cells form an important barrier layer in the genitourinary tract. The 
cells used to model this system were developed by Kyo et al. and are derived from primary 

25 cells immortalized by the addition of the papillomiavirus E6/E7 genes and human 

telomerase reverse transcriptase. The isolated cells have a normal chromosomes and retain 
their responsiveness to sex-steriod hormones, exhibit glandular structure on three 
dimensional culture , and lack a transformed phenotype (Kyo S, et al. Am J Pathol., 2003. 
163(6):2259-69), incorporated herein by reference for its teaching of this endometrial 

30 model. 



UTILITY 

The use of AAVs to deliver genes to the lung would be of benefit in genetic diseases 
like cystic fibrosis, pseudohypoaldosteronism, and immotile cilia syndrome. Furthermore, 
35 delivering genes to the lung would be of impact in several non-genetic diseases. For 
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5 example, delivering genes that make antibiotic like peptides to the cells underlying the 
epithelia would be useful to prevent or treat bronchitis; delivering genes that make growth 
factors would be of value in common diseases like chronic bronchitis. Also, AAVs could be 
used to deliver genes that may play a role in asthma, like IL-10, or antibodies to IgE and 
interleukins. The use of an AAV vector to deliver genes through the alveolar epithelia 

1 0 would be of benefit in genetic diseases like alpha- 1 -antitrypsin deficiency. Furthermore, 
delivering genes through the alveolar epithelia would be of significance in several 
pulmonary non-genetic diseases. For example, delivering genes that make antibiotic like 
peptides would be useful to prevent or treat pneumonia (perhaps of antibiotic-resistant 
organisms); delivering genes that make growth factors would be of value in emphysema; 

1 5 delivering genes that over-express the epithelial sodium channel or the Na-K ATPase could 
be used to treat cardiogenic and non-cardiogenic pulmonary edema; delivering genes that 
have an anti-fibrosis effect like interferon for pulmonary fibrosis would also be useful. Also, 
AAVs could be used to deliver genes that may have a systemic effect like anti-hypertension 
drugs, insulin, coagulation factors, antibiotics, growth factors, hormones and others. 

20 The use of AAVs to deliver genes to the central nervous system (CNS)/ brain would 

be of benefit in neurological diseases, including Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder, panic disorder, learning disabilities, ALS, triplet expansions 
diseases, psychoses, autism, lysosomal storage diseases, Gaucher's disease, Hurler's 

25 disease, Kxabbe's disease, battens disease, and altered behaviors (e.g., disorders in feeding, 
sleep patterns, balance, and perception). 

The use of AAVs to deliver genes to the gastrointestinal system/ gut would be of 
benefit in treatment of diseases and/or Gastrointestinal Disorders such as colon cancers, 
inflammatory bowel disease, diabetes, or Crohn's disease. 

30 The use of AAVs to deliver genes to the genitourinary system would be of benefit in 

treatment of diseases of the female reproductive tract, molecular defects in implantation 
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5 disorders, and gynecological cancers. These methods would also have contraceptive 
applications. 

The use of AAVs to deliver genes to the kidney would be of benefit in treatment of 
inherited renal disorders such as polycystic kidney disease, Alport's syndrome, hereditary 
nephritis, primary hyperoxaluria, and cystinuria. 
10 The use of AAVs for wide-spread delivery of genes across blood vessels into the 

muscle would be of benefit in neuromuscular diseases like muscular dystrophy and 
Cardiovascular Disorders such as heart disease, restenosis, atherosclerosis, myocarditis, 
stoke, angina, or thrombosis. 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 
1 5 tissues of a subject would be of benefit in the treatment of certain cancers (e.g., gastric, 
ovarian, lung, bladder, liver, and breast). 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 
tissues of a subject would be of benefit in the treatment of certain inflammatory disorders, 
including, but not limited to, adrenalitis, alveolitis, angiocholecystitis, appendicitis, 
20 balanitis, blepharitis, bronchitis, bursitis, carditis, cellulitis, cervicitis, cholecystitis, 

chorditis, cochlitis, colitis, conjunctivitis, cystitis, dermatitis, diverticulitis, encephalitis, 
endocarditis, esophagitis, eustachitis, fibrositis, folliculitis, gastritis, gastroenteritis, 
gingivitis, glossitis, hepatosplenitis, keratitis, labyrinthitis, laryngitis, lymphangitis, mastitis, 
media otitis, meningitis, metritis, mucitis, myocarditis, myosititis, myringitis, nephritis, 
25 neuritis, orchitis, osteochondritis, otitis, pericarditis, peritendonitis, peritonitis, pharyngitis, 
phlebitis, poliomyelitis, prostatitis, pulpitis, retinitis, rhinitis, salpingitis, scleritis, 
sclerochoroiditis, scrotitis, sinusitis, spondylitis, steatitis, stomatitis, synovitis, syringitis, 
tendonitis, tonsillitis, urethritis, and vaginitis; and disorders that are characterized by 
inflammation such as hepatitis, rheumatoid arthritis, gout, trauma, pancreatitis, sarcoidosis, 
30 dermatitis, renal ischemia-reperfusion injury, Grave's disease, systemic lupus erythematosus, 
diabetes mellitus, and allogenic transplant rejection. 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/ail 
tissues of a subject would be of benefit in the treatment of other diseases, syndromes and 
conditions, such as adenosine deaminase deficiency, sickle cell deficiency, thalassemia, 
35 hemophilia, diabetes, phenylketonuria, growth disorders, and defects of the immune system. 
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5 BAAV 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier of the lung, comprising delivering to the lung a BAAV vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human bronchial, 
alveolar, tracheal or upper airway epithelial cells. Thus, disclosed is a method of delivering 

10 a heterologous nucleic acid across human airway epithelial cells, comprising delivering to 
the cells a BAAV vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the brain, comprising delivering to the brain a BAAV vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human cerebral 

1 5 microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 

barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells a BAAV vector 
comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across the epithelial 

20 barrier of blood vessels into the muscle, comprising delivering to the blood stream a BAAV 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human vascular endothelial cells. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the genitourinary tract, comprising delivering to the genitourinary tract a BAAV 

25 vector comprising the nucleic acid genitourinary tract. In one aspect of the method, the 

epithelial barrier comprises human endometrial or urinary epithelial cells. Thus, disclosed is 
a method of delivering a heterologous nucleic acid across human endometrial epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

30 barrier in the kidney, comprising delivering to the genitourinary tract a BAAV vector 

comprising the nucleic acid genitourinary tracts In one aspect of the method, the epithelial 
barrier comprises human renal collecting ducts or proximal tubules. Thus, disclosed is a 
method of delivering a heterologous nucleic acid across human kidney epithelial cells, 
comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

35 Disclosed is a method of transcytosing lung epithelial cells of a subject comprising 

contacting the lung epithelial cells of the subject with a BAAV vector comprising a 
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5 heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
bronchial, tracheal, or upper airway epithelial cells. 

Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 
contacting the CNS epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
1 0 cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

Disclosed is a method of transcytosing vascular epithelial cells of a subject 
comprising contacting the vascular epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 

1 5 human vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the genitourinary tract epithelial cells of the subject with a BAAV 
vector comprising a heterologous nucleic acid. In one aspect of the method, the epithelial 
cells are human endometrial or urinary tract epithelial cells. 

20 Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the kidney epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
human renal collecting ducts or proximal tubules 

25 AAV5 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the gut, comprising delivering to the gut an AAV5 vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human absorptive 
enterocytes or M cells. Thus, disclosed is a method of delivering a heterologous nucleic acid 

30 across human gut epithelial cells enterocytes, comprising delivering to the cells an AAV 5 
vector comprising-then uc l eic -acid. 

Disclosed is a method of transcytosing gut epithelial cells of a subject comprising 
contacting the gut epithelial cells of the subject with an AAV 5 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 

35 absorptive enterocytes. 
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5 AAV4 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the gut, comprising delivering to the gut an AAV4 vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human absorptive 
enterocytes or M cells. Thus, disclosed is a method of delivering a heterologous nucleic acid 

10 across human gut epithelial cells enterocytes, comprising delivering to the cells an AAV4 
vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the lung, comprising delivering to the lung an AAV4 vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human bronchial, 

15 tracheal, or upper airway epithelial cells. Thus, disclosed is a method of delivering a 

heterologous nucleic acid across human airway epithelial cells, comprising delivering to the 
cells an AAV4 vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the CNS, comprising delivering to the CNS an AAV4' vector comprising the 

20 nucleic acid. In one aspect of the method, the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 
barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells a AAV4 vector 
comprising the nucleic acid. 

25 Disclosed is a method of delivering a heterologous nucleic acid across the epithelial 

barrier of blood vessels into the muscle, comprising delivering to the blood stream an AAV4 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

30 barrier in the genitourinary tract, comprising delivering to the genitourinary tract an AAV4 
vector comprising the nucleic acid Tn one aspect of the method, the epithelial barrier 
comprises human endometruaJ-or udLaja^gs^^gAial cells. Thus, disclosed is a method of 
delivering a heterologous nucleic acid across human endometrial epithelial cells, comprising 
delivering to the cells an AAV4 vector comprising the nucleic acid. 

35 Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

barrier in the kidneys, comprising delivering to the kidneys an AAV4 vector comprising the 
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5 nucleic acid. In one aspect of the method, the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. Thus, disclosed is a method of delivering a 
heterologous nucleic acid across human kidney epithelial cells, comprising delivering to the 
cells an AAV4 vector comprising the nucleic acid. 

Disclosed is a method of transcytosing lung epithelial cells of a subject comprising 
10 contacting the lung epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
bronchial, tracheal, or upper airway epithelial cells. 

Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 
contacting the CNS epithelial cells of the subject with an AAV4 vector comprising a 
1 5 heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 

cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

Disclosed is a method of transcytosing vascular epithelial cells of a subject 
comprising contacting the vascular epithelial cells of the subject with an AAV4 vector 

20 comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the genitourinary epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 

25 human endometrial or urinary epithelial cells. 

Disclosed is a method of transcytosing kidney epithelial cells of a subject comprising 
contacting the kidney epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human renal 
collecting ducts or proximal tubules 

30 Disclosed is a method of transcytosing gut epithelial cells of a subject comprising 

contacting the CNS epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
absorptive enterocytes. 
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5 Inhibition of Transcvtosis to Increase Transduction 

Described herein is a method for re-directing virus that enters a cell by transcytosis 
to result in transduction of the cell by blocking exocytosis. Thus, provided is a method of 
improving the efficiency of nucleic acid delivery to epithelial cells, comprising delivering to 
the cells an inhibitor of exocytosis and an AAV vector containing the nucleic acid. Also 
10 provided is a method for transducing cells that have transcytosis activity but are normally 
resistant to transduction comprising administering to the cells inhibitors of exocytosis. 

In one aspect of the methods, the AAV vector is derived from AAV4, AAV5, or 
BAAV. In a further aspect of the methods, the epithelial cell barriers are located in the 
kidney, gut, lung or vascular endothelium 
15 Thus, disclosed is a method of delivering a heterologous nucleic acid to human 

airway epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
kidney epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
20 an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
vascular endothelial cells, comprising delivering to the cells and an inhibitor of exocytosis 
and an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
25 airway epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
a BAAV vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
kidney epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
a BAAV vector comprising the nucleic acid. 
30 Further disclosed is a method of delivering a heterologous nucleic acid to human 

vascular endothelial cells, comprising delivering to the cells and an inhibitor of exocytosis 
and a BAAV vector comprisiag *fee zzsekmrnmrnd. 

Further disclosed is a method of delivering a heterologous nucleic acid to human gut 
epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and an 
35 AAV5 vector comprising the nucleic acid. 
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5 In one aspect of the disclosed methods, the inhibitors of exocytosis are chemical 

modifiers. In a further aspect of the methods, the chemical modifier is tannic acid, wherein 
the tannic acid is delivered to the basal lateral surface of the epithelial cells. 

Compositions and methods for making AAV4 vectors 

1 0 Compositions and methods for making and using AA V4 vectors have been 

previously described in U.S. Patent No. 6,468,524, incorporated herein by reference for this 
teaching. 

Provided is the nucleotide sequence of the adeno-associated virus 4 (AAV4) genome 
and vectors and particles derived therefrom. Specifically, provided is a nucleic acid vector 

1 5 comprising a pair of AAV4 inverted terminal repeats (ITRs) and a promoter between the 
inverted terminal repeats. The AAV4 ITRs are exemplified by the nucleotide sequence set 
forth in SEQ ID NO:6 and SEQ ED NO:20; however, these sequences can have minor 
modifications and still be contemplated to constitute AAV4 ITRs. The nucleic acid listed in 
SEQ ID NO:6 depicts the ITR in the "flip" orientation of the ITR. The nucleic acid listed in 

20 SEQ ID NO:20 depicts the ITR in the "flop" orientation of the ITR. Minor modifications in 
an ITR of either orientation are those that will not interfere with the hairpin structure formed 
by the AAV4 ITR as described herein and known in the art. Furthermore, to be considered 
within the term "AAV4 ITRs" the nucleotide sequence must retain the Rep binding site 
described herein and exemplified in SEQ ED NO:6 and SEQ ID NO:20, i.e., it must retain 

25 one or both features described herein that distinguish the AAV4 ITR from the AAV2 ITR: 
(1) four (rather than three as in AAV2) "GAGC" repeats and (2) in the AAV4 ITR Rep 
binding site the fourth nucleotide in the first two "GAGC" repeats is a T rather than a C. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 

30 type in which the vector is to be used. Promoters can be an exogenous or an endogenous 

promoter. Promoters can include, for example, known strong promoters such as S V40 or the 
inducible metallothionein promoter, or an AAV promoter, such as an AAV p5 promoter. 
Additional examples of promoters include promoters derived from actin genes, 
immunoglobulin genes, cytomegalovirus (CMV), adenovirus, bovine papilloma virus, 

35 adenoviral promoters, such as the adenoviral major late promoter, an inducible heat shock 
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5 promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), etc. Specifically, the 
promoter can be AAV2 p5 promoter or AAV4 p5 promoter. More specifically, the AAV4 
p5 promoter can be about nucleotides 130 to 291 of SEQ ID NO: 1. Additionally, the p5 
promoter may be enhanced by nucleotides 1-130. Furthermore, smaller fragments of p5 
promoter that retain promoter activity can readily be determined by standard procedures 

10 including, for example, constructing a series of deletions in the p5 promoter, linking the 
deletion to a reporter gene, and determining whether the reporter gene is expressed, i.e., 
transcribed and/or translated. 

The present invention also contemplates any unique fragment of these AAV4 nucleic 
acids, including the AAV4 nucleic acids set forth in SEQ ID NOs: 1, 3, 5, 6, 7, 12-15, 17 

1 5 and 19. To be unique, the fragment must be of sufficient size to distinguish it from other 
known sequences, most readily determined by comparing any nucleic acid fragment to the 
nucleotide sequences of nucleic acids in computer databases, such as GenBank. Such 
comparative searches are standard in the art. Typically, a unique fragment useful as a primer 
or probe will be at least about 8 or 10 to about 20 or 25 nucleotides in length, depending 

20 upon the specific nucleotide content of the sequence. Additionally, fragments can be, for 
example, at least about 30, 40, 50, 75, 100, 200 or 500 nucleotides in length. The nucleic 
acid can be single or double stranded, depending upon the purpose for which it is intended. 

The present invention further provides an AAV4 Capsid polypeptide or a unique 
fragment thereof. AAV4 capsid polypeptide is encoded by ORF 2 of AAV4. Specifically, 

25 provided is an AAV4 Capsid protein comprising the amino acid sequence encoded by 

nucleotides 2260-4464 of the nucleotide sequence set forth in SEQ ID NO:l, or a unique 
fragment of such protein. The present invention also provides an AAV4 Capsid protein 
consisting essentially of the amino acid sequence encoded by nucleotides 2260-4464 of the 
nucleotide sequence set forth in SEQ ID NO:l, or a unique fragment of such protein. The 

30 present invention further provides the individual AAV4 coat proteins, VP1, VP2 and VP3. 
Thus, provided is an isolated polypeptide having the amino acid sequence set forth in SEQ 
ID NO:4 (VP1). The present invention additionally provides an isolated polypeptide having 
the amino acid sequence set forth in SEQ ID NO: 16 (VP2). The present invention also 
provides an isolated polypeptide having the amino acid sequence set forth in SEQ ED NO: 18 

35 (VP3). By "unique fragment thereof 1 is meant any smaller polypeptide fragment encoded by 
any AAV4 capsid gene that is of sufficient length to be unique to the AAV4 Capsid protein. 
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5 Substitutions and modifications of the amino acid sequence can be made as described above 
and, further, can include protein processing modifications, such as glycosylation, to the 
polypeptide. However, an AAV4 Capsid polypeptide including all three coat proteins will 
have at least about 63% overall homology to the polypeptide encoded by nucleotides 2260- 
4464 of the sequence set forth in SEQ ED NO: 1. The protein can have about 65%, about 

10 70%, about 75%, about 80%, about 85%, about 90%, about 95% or even 100% homology to 
the amino acid sequence encoded by the nucleotides 4467of the sequence set forth in SEQ 
ED NO:l. An AAV4 VP2 polypeptide can have at least about 58%, about 60%, about 70%, 
about 80%, about 90% about 95% or about 100% homology to the amino acid sequence set 
forth in SEQ ID NO: 16. An AAV4 VP3 polypeptide can have at least about 60%, about 

1 5 70%, about 80%, about 90% about 95% or about 1 00% homology to the amino acid 
sequence set forth in SEQ ID NO: 18. 

The herein described AAV4 nucleic acid vector can be encapsidated in an AAV 
particle. In particular, it can be encapsidated in an AAV1 particle, an AAV2 particle, an 
AAV3 particle, an AAV4 particle, or an AAV5 particle by standard methods using the 

20 appropriate capsid proteins in the encapsidation process, as long as the nucleic acid vector 
fits within the size limitation of the particle utilized. The encapsidation process itself is 
standard in the art. 

An AAV4 particle is a viral particle comprising an AAV4 capsid protein. An AAV4 
capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide can overall have at 

25 least about 63% homology to the polypeptide having the amino acid sequence encoded by 

nucleotides 2260*4464 set forth in SEQ ID NO:l (AAV4 capsid protein). The capsid protein 
can have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by nucleotides 2260-4464 set forth in SEQ 

30 ID NO: 1 . The particle can be a particle comprising both AAV4 and AAV2 capsid protein, 
i. e., a chimeric protein. Variations in the amino acid sequence of the AAV4 capsid protein 
*r>~ contemplated herein, as long as the resulting viral particle comprising the AAV4 capsid 
remains antigenically or immunologically distinct from AAV2, as can be routinely 
determined by standard methods. Specifically, for example, ELISA and Western blots can 

35 be used to determine whether a viral particle is antigenically or immunologically distinct 
from AAV2. Furthermore, the AAV4 viral particle preferably retains tissue tropism 
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5 distinction from AAV2, such as that exemplified in the examples herein, though an AAV 4 
chimeric particle comprising at least one AAV4 coat protein may have a different tissue 
tropism from that of an AAV4 particle consisting only of AAV4 coat proteins. 

An AAV4 particle is a viral particle comprising an AAV4 capsid protein. An AAV4 
capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide can overall have at 

1 0 least about 63% homology to the polypeptide having the amino acid sequence encoded by 

nucleotides 2260-4467 set forth in SEQ ID NO:l (AAV4 capsid protein). The capsid protein 
can have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by nucleotides 2260-4467 set forth in SEQ 

1 5 ID NO: 1 . The particle can comprise only VP1 and VP3 and still stably transduce cells. 
The particle can be a particle comprising both AAV4 and AAV2 capsid protein, i.e., a 
chimeric protein. Variations in the amino acid sequence of the AAV4 capsid protein are 
contemplated herein, as long as the resulting viral particle comprising the AAV4 capsid 
remains antigenically or immunologically distinct from AAV2, as can be routinely 

20 determined by standard methods. Specifically, for example, ELISA and Western blots can 
be used to determine whether a viral particle is antigenically or immunologically distinct 
from AAV2. Furthermore, the AAV4 viral particle preferably retains tissue tropism 
distinction from AAV2, such as that exemplified in the examples herein, though an AAV4 
chimeric particle comprising at least one AAV4 coat protein may have a different tissue 

25 tropism from that of an AAV4 particle consisting only of AAV4 coat proteins. 

The invention further provides an AAV4 particle containing, i.e., encapsidating, a 
vector comprising a pair of AAV2 inverted terminal repeats. The nucleotide sequence of 
AAV2 ITRs is known in the art. Furthermore, the particle can be a particle comprising both 
AAV4 and AAV2 capsid protein, i.e., a chimeric protein. The vector encapsidated in the 

30 particle can further comprise an exogenous nucleic acid inserted between the inverted 
terminal repeats. 

The present invention Gj&hez ^^^m isolated nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO:l (AAV4 genome). This nucleic acid, or 
portions thereof, can be inserted into other vectors, such as plasmids, yeast artificial 
35 chromosomes, or other viral vectors, if desired, by standard cloning methods. The present 
invention also provides an isolated nucleic acid consisting essentially of the nucleotide 
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5 sequence set forth in SEQ ID NO: 1 . The nucleotides of SEQ ID NO: 1 can have minor 
modifications and still be contemplated by the present invention. For example, 
modifications that do not alter the amino acid encoded by any given codon (such as by 
modification of the third, "wobble," position in a codon) can readily be made, and such 
alterations are known in the art. Furthermore, modifications that cause a resulting neutral 

1 0 amino acid substitution of a similar amino acid can be made in a coding region of the 

genome. Additionally, modifications as described herein for the AAV4 components, such as 
the ITRs, the p5 promoter, etc. are contemplated in this invention. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with an isolated nucleic acid consisting essentially of the nucleotide sequence set 

1 5 forth in SEQ ID NO:l (AAV4 genome). The present invention further provides an isolated 
nucleic acid that selectively hybridizes with an isolated nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO:l (AAV4 genome). By "selectively hybridizes" 
as used in the claims is meant a nucleic acid that specifically hybridizes to the particular 
target nucleic acid under sufficient stringency conditions to selectively hybridize to the 

20 target nucleic acid without significant background hybridization to a nucleic acid encoding 
an unrelated protein, and particularly, without detectably hybridizing to AAV2. Thus, a 
nucleic acid that selectively hybridizes with a nucleic acid of the present invention will not 
selectively hybridize under stringent conditions with a nucleic acid encoding a different 
protein, and vice versa. Therefore, nucleic acids for use, for example, as primers and probes 

25 to detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 
that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 
probes for further hybridization or for amplification methods {e.g., polymerase chain 
reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both AAV4 and a gene of interest carried 

30 within the AAV4 vector {i.e., a chimeric nucleic acid). 

The present invention further provides an isolated nucleic acid encoding an adeno- 
associated virus 4 Rep protein. The AAV4 Rep proteins are encoded by open reading frame 
(ORF) 1 of the AAV4 genome. The AAV4 Rep genes are exemplified by the nucleic acid 
set forth in SEQ ID NO: 3 (AAV4 ORF1), and include a nucleic acid consisting essentially 

35 of the nucleotide sequence set forth in SEQ ID NO:3 and a nucleic acid comprising the 

nucleotide sequence set forth in SEQ ID NO:3. The present invention also includes a nucleic 
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5 acid encoding the amino acid sequence set forth in SEQ ID NO: 2 (polypeptide encoded by 
AAV4 ORF1). However, the present invention includes that the Rep genes nucleic acid can 
include any one, two, three, or four of the four Rep proteins, in any order, in such a nucleic 
acid. Furthermore, minor modifications are contemplated in the nucleic acid, such as silent 
mutations in the coding sequences, mutations that make neutral or conservative changes in 
1 0 the encoded amino acid sequence, and mutations in regulatory regions that do not disrupt 
the expression of the gene. Examples of other minor modifications are known in the art. 
Further modifications can be made in the nucleic acid, such as to disrupt or alter expression 
of one or more of the Rep proteins in order to, for example, determine the effect of such a 
disruption; such as to mutate one or more of the Rep proteins to determine the resulting 
1 5 effect, etc. However, in general, a modified nucleic acid encoding all four Rep proteins will 
have at least about 90%, about 93%, about 95%, about 98% or 100% homology to the 
sequence set forth in SEQ ID NO:3, and the Rep polypeptide encoded therein will have 
overall about 93%, about 95%, about 98%, about 99% or 100% homology with the amino 
acid sequence set forth in SEQ ID NO:2. 
20 The present invention also provides an isolated nucleic acid that selectively 

hybridizes with a nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO:3 and an isolated nucleic acid that selectively hybridizes with a nucleic acid 
comprising the nucleotide sequence set forth in SEQ ID NO:3. "Selectively hybridizing" is 
defined elsewhere herein. 
25 The present invention also provides each individual AAV4 Rep protein and the 

nucleic acid encoding each. Thus provided is the nucleic acid encoding a Rep 40 protein, 
and in particular an isolated nucleic acid comprising the nucleotide sequence set forth in 
SEQ ID NO: 12, an isolated nucleic acid consisting essentially of the nucleotide sequence set 
forth in SEQ ID NO: 12, and a nucleic acid encoding the adeno-associated virus 4 Rep 
30 protein having the amino acid sequence set forth in SEQ ID NO:8. The present invention 
also provides the nucleic acicLencoding aJLep 52 protein, and in particular an isolated 
nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 13, an isolated 
nucleic acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 13, 
and a nucleic acid encoding the adeno-associated virus 4 Rep protein having the amino acid 
35 sequence set forth in SEQ ID NO:9. The present invention further provides the nucleic acid 
encoding a Rep 68 protein, and in particular an isolated nucleic acid comprising the 
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5 nucleotide sequence set forth in SEQ ID NO: 14^ an isolated nucleic acid consisting 
essentially of the nucleotide sequence set forth in SEQ ID NO: 14, and a nucleic acid 
encoding the adeno-associated virus 4 Rep protein having the amino acid sequence set forth 
in SEQ ED NO: 10. And, further, provided is the nucleic acid encoding a Rep 78 protein, and 
in particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 

10 NO: 15, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 15, and a nucleic acid encoding the adeno-associated virus 4 Rep protein 
having the amino acid sequence set forth in SEQ ED NO:l 1. As described elsewhere herein, 
these nucleic acids can have minor modifications, including silent nucleotide substitutions, 
mutations causing neutral amino acid substitutions in the encoded proteins, and mutations in 

15 control regions that do not or minimally affect the encoded amino acid sequence. 

The present invention further provides a nucleic acid encoding the entire AAV4 
Capsid polypeptide. Specifically, provided is a nucleic acid having the nucleotide sequence 
set for the nucleotides 2260-4467 of SEQ ID NO:l. Furthermore, provided is a nucleic acid 
encoding each of the three AAV4 coat proteins, VP1, VP2, and VP3. Thus, provided is a 

20 nucleic acid encoding AAV4 VP1, a nucleic acid encoding AAV4 VP2, and a nucleic acid 
encoding AAV4 VP3. Thus, provided is a nucleic acid encoding the amino acid sequence 
set forth in SEQ ID NO:4 (VP1); a nucleic acid encoding the amino acid sequence set forth 
in SEQ ID NO: 16 (VP2), and a nucleic acid encoding the amino acid sequence set forth in 
SEQ ED NO: 18 (VP3). The present invention also specifically provides a nucleic acid 

25 comprising SEQ ID NO:5 (VP1 gene); a nucleic acid comprising SEQ ID NO: 17 (VP2 

gene); and a nucleic acid comprising SEQ ID NO: 19 (VP3 gene). The present invention also 
specifically provides a nucleic acid consisting essentially of SEQ ID NO:5 (VP1 gene), a 
nucleic acid consisting essentially of SEQ ID NO: 17 (VP2 gene), and a nucleic acid 
consisting essentially of SEQ ID NO: 19 (VP3 gene). Furthermore, a nucleic acid encoding 

30 an AAV4 capsid protein VP1 is set forth as nucleotides 2260-4467 of SEQ ID NO:l; a 

* nucleic acid encoding an AAV4 capsid protein VP2 is set forth as nucleotides 2668-4467 of 
SEQ ID NO:l; and a nucleic acid encoding an AAV4 capsid protein VP3 is set forth as 
nucleotides 2848-4467 of SEQ ID NO:l. Minor modifications in the nucleotide sequences 
encoding the capsid, or coat, proteins are contemplated, as described above for other AAV4 

35 nucleic acids. 
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5 Provided is an isolated AAV4 Rep protein. AAV4 Rep polypeptide is encoded by 

ORF1 of AAV4. Specifically, provided is an AAV4 Rep polypeptide comprising the amino 
acid sequence set forth in SEQ ED NO:2, or a unique fragment thereof. The present 
invention also provides an AAV 4 Rep polypeptide consisting essentially of the amino acid 
sequence set forth in SEQ ED NO:2, or a unique fragment thereof. Additionally, nucleotides 

10 29 1 -2306 of the AAV4 genome, which genome is set forth in SEQ ID NO: 1 , encode the 

AAV4 Rep polypeptide. The present invention also provides each AAV4 Rep protein. Thus 
provided is AAV4 Rep 40, or a unique fragment thereof. The present invention particularly 
provides Rep 40 having the amino acid sequence set forth in SEQ ID NO:8. Provided is 
AAV 4 Rep 52, or a unique fragment thereof. The present invention particularly provides 

1 5 Rep 52 having the amino acid sequence set forth in SEQ ID NO:9. Provided is AAV4 Rep 
68, or a unique fragment thereof. The present invention particularly provides Rep 68 having 
the amino acid sequence set forth in SEQ ID NO: 10. Provided is AAV4 Rep 78, or a unique 
fragment thereof The present invention particularly provides Rep 78 having the amino acid 
sequence set forth in SEQ ID NO:l 1. By "unique fragment thereof 1 is meant any smaller 

20 polypeptide fragment encoded by AAV rep gene that is of sufficient length to be unique to 
the Rep polypeptide. Substitutions and modifications of the amino acid sequence can be 
made as described above and, further, can include protein processing modifications, such as 
glycosylation, to the polypeptide. However, a polypeptide including all four Rep proteins 
will encode a polypeptide having at least about 91% overall homology to the sequence set 

25 forth in SEQ ID NO:2, and it can have about 93%, about 95%, about 98%, about 99% or 
100% homology with the amino acid sequence set forth in SEQ ID NO:2. 

The present invention further provides an AAV4 Capsid polypeptide or a unique 
fragment thereof AAV4 capsid polypeptide is encoded by ORF 2 of AAV4. Specifically, 
provided is an AAV4 Capsid protein comprising the amino acid sequence encoded by 

30 nucleotides 2260-4467_of the nucleotide sequence set forth in SEQ ID NO: 1, or a unique 
fragment of such protein. The present invention also provides an AAV4 Capsid protein 
consisting essentially of the amino acid sequence encoded by nucleotides 2260-4467 of the 
nucleotide sequence set forth in SEQ ID NO:l, or a unique fragment of such protein. The 
present invention further provides the individual AAV4 coat proteins, VP1, VP2 and VP3. 

35 Thus, provided is an isolated polypeptide having the amino acid sequence set forth in SEQ 
ID NO:4 (VP1). The present invention additionally provides an isolated polypeptide having 
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5 the amino acid sequence set forth in SEQ ID NO: 16 (VP2). The present invention also 

provides an isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 18 
(VP3). By "unique fragment thereof 1 is meant any smaller polypeptide fragment encoded by 
any AAV4 capsid gene that is of sufficient length to be unique to the AAV4 Capsid protein. 
Substitutions and modifications of the amino acid sequence can be made as described above 

1 0 and, further, can include protein processing modifications, such as glycosylation, to the 
polypeptide. However, an AAV4 Capsid polypeptide including all three coat proteins will 
have at least about 63% overall homology to the polypeptide encoded by nucleotides 2260- 
4467 of the sequence set forth in SEQ ID NO: 1. The protein can have about 65%, about 
70%, about 75%, about 80%, about 85%, about 90%, about 95% or even 100% homology to 

15 the amino acid sequence encoded by the nucleotides 2260-4467 of the sequence set forth in 
SEQ ID NO:4. An AAV4 VP2 polypeptide can have at least about 58%, about 60%, about 
70%, about 80%, about 90% about 95% or about 100% homology to the amino acid 
sequence set forth in SEQ ID NO: 16. An AAV4 VP3 polypeptide can have at least about 
60%, about 70%, about 80%, about 90% about 95% or about 100% homology to the amino 

20 acid sequence set forth in SEQ ID NO: 1 8. 

The AAV inverted terminal repeats in the vector for the herein described delivery 
methods can be AAV4 inverted terminal repeats. Specifically, they can comprise the nucleic 
acid whose nucleotide sequence is set forth in SEQ ID NO:6 or the nucleic acid whose 
nucleotide sequence is set forth in SEQ ID NO:20, or any fragment thereof demonstrated to 

25 have ITR functioning. The ITRs can also consist essentially of the nucleic acid whose x 
nucleotide sequence is set forth in SEQ ED NO:6 or the nucleic acid whose nucleotide 
sequence is set forth in SEQ ID NO:20. Furthermore, the AAV inverted terminal repeats in 
the vector for the herein described nucleic acid delivery methods can also comprise AAV2 
inverted terminal repeats. Additionally, the AAV inverted terminal repeats in the vector for 

30 this delivery method can also consist essentially of AAV2 inverted terminal repeats. 

Compositions and methods for making AAV5 vectors 

Compositions and methods for making and using AAV5 vectors have been 
previously described in U.S. Patent Application No. 09/533427, filed March 22, 2000, 
35 incorporated herein by reference for this teaching. 
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5 The present application provides a recombinant adeno-associated virus 5 (AAV5). 

This virus has one or more of the characteristics described below. The compositions of the 
present invention do not include wild-type AAV5. The methods of the present invention can 
use either wild-type AAV5 or recombinant AAV5-based delivery. 

Provided are novel AAV 5 particles, recombinant AAV5 vectors, recombinant AAV5 

10 virions and novel AAV5 nucleic acids and polypeptides. An AAV5 particle is a viral 

particle comprising an AAV5 capsid protein. A recombinant AAV5 vector is a nucleic acid 
construct that comprises at least one unique nucleic acid of AAV5. A recombinant AAV5 
virion is a particle containing a recombinant AAV5 vector, wherin the particle can be either 
an AAV5 particle as described herein or a non-AAV5 particle. Alternatively, the 

1 5 recombinant AAV 5 virion is an AAV5 particle containing a recombinant vector, wherein 
the vector can be either an AAV5 vector as described herein or a non-AAV5 vector. These 
vectors, particles, virions, nucleic acids and polypeptides are described below. 

Provided is the nucleotide sequence of the AAV5 genome and vectors and particles 
derived therefrom. Specifically, provided is a nucleic acid vector comprising a pair of 

20 AAV 5 inverted terminal repeats (ITRs) and a promoter between the inverted terminal 

repeats. While the rep proteins of AAV2 and AAV5 will bind to either a type 2 ITR or a 
type 5 ITR, efficient genome replication only occurs when type 2 Rep replicates a type 2 
ITR and a type 5 Rep replicates a type 5 ITR. This specificity is the result of a difference in 
DNA cleavage specificity of the two Reps which is necessary for replication. AAV5 Rep 

25 cleaves at CGGT A GTG A (SEQ ID NO: 43) and AAV2 Rep cleaves at CGGT^TGAG (SEQ 
ID NO: 44) (Chiorini et al., 1999. J. Virol. 73 (5) 4293-4298). Mapping of the AAV5 ITR 
terminal resolution site (TRS) identified this distinct cleavage site, CGGT A GTGA, which is 
absent from the ITRs of other AAV serotypes. Therefore, the minimum sequence necessary 
to distinguish AAV5 from AAV2 is the TRS site where Rep cleaves in order to replicate the 

30 virus. Examples of the type 5 ITRs are shown in SEQ ID NO: 41 and SEQ ID NO: 42, 

AAV5 ITR "flip" and AAV5_liflop'\ respectively. Minor modifications in an ITR of either 
orientation are contemplated -sMxcLare thr*** that will not interfere with the hairpin structure 
formed by the AAV5 ITR as described herein. Furthermore, to be considered within the 
term "AAV5 ITR" the nucieotide-sequenT;enmast retain one or more features described 

35 herein that distinguish the AAV5 ITR from the ITRs of other serotypes, e.g. it must retain 
the Rep binding site described herein. 
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5 The D- region of the AAV5 ITR (SEQ ID NO: 45), a single stranded region of the 

ITR, inboard of the TRS site, has been shown to bind a factor which depending on its 
phosphorylation state correlates with the conversion of the AAV from a single stranded 
genome to a transcriptionally active form that allows for expression of the viral DNA. This 
region is conserved between AAV2, 3, 4,and 6 but is divergent in AAV5. The D+ region is 

10 the reverse complement of the D- region. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. That is, the promoter can be tissue/cell-specific. 
Promoters can be prokaryotic, eukaryotic, fungal, nuclear, mitochondrial, viral or plant 

15 promoters. Promoters can be exogenous or endogenous to the cell type being transduced by 
the vector. Promoters can include, for example, bacterial promoters, known strong 
promoters such as S V40 or the inducible metallothionein promoter, or an AAV promoter, 
such as an AAV p5 promoter. Additionally, chimeric regulatory promoters for targeted gene 
expression can be utilized. Examples of these regulatory systems, which are known in the 

20 art, include the tetracycline based regulatory system which utilizes the tet transactivator 
protein (tTA), a chimeric protein containing the VP 16 activation domain fused to the tet 
repressor of Escherichia coli, the IPTG based regulatory system, the CDD based regulatory 
system, and the Ecdysone based regulatory system. Other promoters include promoters 
derived from actin genes, immunoglobulin genes, cytomegalovirus (CMV), adenovirus, 

25 bovine papilloma virus, adenoviral promoters, such as the adenoviral major late promoter, 
an inducible heat shock promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), 
etc., specifically, the promoter can be AAV2 p5 promoter or AAV5 p5 promoter. More 
specifically, the AAV5 p5 promoter can be about same location in SEQ ED NO: 23 as the 
AAV2 p5 promoter, in the corresponding AAV2 published sequence. An example of an 

30 AAV5 p5 promoter is nucleotides 220-338 of SEQ ED NO: 23. Additionally, the p5 

promoter may be enhanced by nucleotides 1-130 of SEQ ID NO: 23. Furthermore, smaller 
fragments of p5 promoter that retain promoter activity can readily be determined by standard 
procedures including, for example, constructing a series of deletions in the p5 promoter, 
linking the deletion to a reporter gene, and determining whether the reporter gene is 

35 expressed, i.e., transcribed and/or translated. The promoter can be the promoter of any of the 
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5 AAV serotypes, and can be the pl9 promoter (SEQ ID NO: 38) or the p40 promoter set 
forth in the sequence listing as SEQ ID NO: 39. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 
herein can be corrected, for example, by using the hybridization procedure described below 
with various probes derived from the described sequences such that the coding sequence can 

10 be reisolated and resequenced. Rapid screening for point mutations can also be achieved 
with the use of polymerase chain reaction single strand conformation polymorphism (PCR 
SSCP). The corresponding amino acid sequence can then be corrected accordingly. 

The AAV5-derived vector can include any normally occurring AAV5 sequences in 
addition to an ITR and promoter. Examples of vector constructs are provided below. 

1 5 The present vector or AAV5 particle or recombinant AA V5 virion can utilize any 

unique fragment of the present AAV5 nucleic acids, including the AAV5 nucleic acids set 
forth in SEQ ID NOS: 23 and 29-33, 35, 37, 38, 39 and 40. To be unique, the fragment must 
be of sufficient size to distinguish it from other known sequences, most readily determined 
by comparing any nucleic acid fragment to the nucleotide sequences of nucleic acids in 

20 computer databases, such as GenBank. Such comparative searches are standard in the art. 
Typically, a unique fragment useful as a primer or probe will be at least about 8 or 10, 
preferable at least 20 or 25 nucleotides in length, depending upon the specific nucleotide 
content of the sequence. Additionally, fragments can be, for example, at least about 30, 40, 
50, 75, 100, 200 or 500 nucleotides in length and can encode polypeptides or be probes. The 

25 nucleic acid can be single or double stranded, depending upon the purpose for which it is 
intended. Where desired, the nucleic acid can be RNA. 

The present invention further provides an isolated AAV5 capsid protein to contain 
the vector. In particular, provided is not only a polypeptide comprising all three AAV5 coat 
proteins, i.e., VP1, VP2 and VP3, but also a polypeptide comprising each AAV5 coat 

30 protein individually, SEQ ID NOS: 26, 27, and 28, respectively. Thus an AAV5 particle 

comprising an AAV5 capsid-protein comprises aUeast one AAV 5 coat protein VP1, VP2 or 
VP3. An AAV5 particle comprising an AAV 5 capsid protein can be utilized to deliver a 
nucleic acid vector to a cell, tissue or subject. For example, the herein described AAV5 
vectors can be encapsidated in an AAV5 capsid-derived particle and utilized in a gene 

35 delivery method. Furthermore, other viral nucleic acids can be encapsidated in the AAV5 
particle and utilized in such delivery methods. For example, an AAV1, 2,3,4,or 6 vector 
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5 (e.g. AAV 1 ,2,3,4,or 6 ITR and nucleic acid of interest )can be encapsidated in an AAV5 

particle and administered. Furthermore, an AAV5 chimeric capsid.incorporating both AAV2 
capsid and AAV5 capsid sequences can be generated, by standard cloning methods, 
selecting regions from the known sequences of each protein as desired. For example, 
particularly antigenic regions of the AAV2 capsid protein can be replaced with the 

10 corresponding region of the AAV5 capsid protein. In addition to chimeric capsids 

incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3, 4, or 6 and 
AAV 5 capsid sequences can be generated, by standard cloning methods, selecting regions 
from the known sequences of each protein as desired. The particle can also comprise only 
VP1 and VP3 capsid proteins. 

15 The capsids can also be modified to alter their specific tropism by genetically 

altering the capsid to encode a specific ligand to a cell surface receptor. Alternatively, the 
capsid can be chemically modified by conjugating a ligand to a cell surface receptor. By 
genetically or chemically altering the capsids, the tropism can be modified to direct AAV5 
to a particular cell or population of cells. The capsids can also be altered immunologically 

20 by conjugating the capsid to an antibody that recognizes a specific protein on the target cell 
or population of cells. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 

25 AAV5 particle comprising an AAV5 capsid protein. 

The herein described recombinant AAV5 nucleic acid derived vector can be 
encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle or an AAV6 
particle, a portion of any of these capsids, or a chimeric capsid particle as described above, 

30 by standard methods using the appropriate capsid proteins in the encapsidation process, as 
long as the nucleic acid vector fits within the size limitation of the particle utilized. The 
encapsidation process itself is standard in the art. The AAV5 replication machinery, i.e. the 
rep initiator proteins and other functions required for replication, can be utilized to produce 
the AAV5 genome that can be packaged in an AAV1, 2, 3, 4, 5 or 6 capsid. 

35 The recombinant AAV5 virion containing a vector can also be produced by 

recombinant methods utilizing multiple plasmids. In one example, the AAV5 rep nucleic 
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5 acid would be cloned into one plasmid, the AAV5 ITR nucleic acid would be cloned into 
another plasmid and the AAV1, 2, 3, 4, 5 or 6 capsid nucleic acid would be cloned on 
another plasmid. These plasmids would then be introduced into cells. The cells that were 
efficiently transduced by all three plasmids, would exhibit specific integration as well as the 
ability to produce recombinant AAV5 virion. Additionally, two plasmids could be used 

10 where the AAV5 rep nucleic acid would be cloned into one plasmid and the AAV5 ITR and 
AAV5 capsid would be cloned into another plasmid. These plasmids would then be 
introduced into cells. The cells that were efficiently transduced by both plasmids, would 
exhibit specific integration as well as the ability to produce recombinant AAV5 virion. 

An AAV5 capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide 

1 5 can have greater than 56% overall homology to the polypeptide having the amino acid 

sequence encoded by nucleotides in SEQ ID NOS: 29, 30, 31, as shown in figures 4 and 5. 
The capsid protein can have about 70% homology, about 75% homology, 80% homology, 
85% homology, 90% homology, 95% homology, 98% homology, 99% homology, or even 
100% homology to the protein having the amino acid sequence encoded by the nucleotides 

20 set forth in SEQ ID NOS: 29, 30, or 3 1 . The percent homology used to identify proteins 

herein, can be based on a nucleotide-by-nucleotide comparison or more preferable is based, 
on a computerized algorithm as described herein. Variations in the amino acid sequence of 
the AAV5 capsid protein are contemplated herein, as long as the resulting particle 
comprising an AAV5 capsid protein remains antigenically or immunologically distinct from 

25 AAV1, AAV2, AAV3, AAV4 or AAV6 capsid, as can be routinely determined by standard 
methods. Specifically, for example, ELISA and Western blots can be used to determine 
whether a viral particle is antigenically or immunologically distinct from AAV2 or the other 
serotypes. Furthermore, the AAV5 particle preferably retains tissue tropism distinction 
from AAV2, such as that exemplified in the examples herein. An AAV5 chimeric particle 

30 comprising at least one AAV5 coat protein may have a different tissue tropism from that of 
an AAV5 particle consisting only of AAV5 coat proteins, but is still distinct from the 
tropism of an AAV2 particle, in that it will infect some cells not infected by AAV2 or an 
AAV 2 particle. 

The invention further provides a recombinant AAV5 virion, comprising an AAV5 
35 particle containing, i.e., encapsidating, a vector comprising a pair of AAV5 inverted 

terminal repeats. The recombinant vector can further comprise an AAV5 Rep-encoding 
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5 nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. AAV 5 Rep confers targeted 
integration and efficient replication, thus production of recombinant AAV5, comprising 
AAV5 Rep, yields more particles than production of recombinant AAV2. Since AAV5 is 
more efficient at replicating and packaging its genome, the exogenous nucleic acid inserted, 
10 or in the AAV5 capsids of the present invention, between the inverted terminal repeats can 
be packaged in the AAV1 , 2, 3, 4, or 6 capsids to achieve the specific tissue tropism 
conferred by the capsid proteins. 

The invention further contemplates chimeric recombinant ITRs that contains a rep 
binding site and a TRS site recognized by that Rep protein. By "Rep protein" is meant all 
1 5 four of the Rep proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" 

could be one or more of the Rep proteins described herein. One example of a chimeric ITR 
would consist of an AAV5 D region (SEQ ED NO: 45), an AAV5 TRS site (SEQ ID NO: 
43), an AAV2 hairpin and an AAV2 binding site. Another example would be an AAV5 D 
region, an AAV5 TRS site, an AAV3 hairpin and an AAV3 binding site. In these chimeric 
20 ITRs, the D region can be from AAV1, 2, 3, 4, 5 or 6. The hairpin can be derived from AAV 
1,2 3, 4, 5, 6. The binding site can be derived from any of AAV1, 2, 3, 4, 5 or 6. Preferably, 
the D region and the TRS are from the same serotype. 

The chimeric ITRs can be combined with AAV5 Rep protein and any of the AAV 
serotype capsids to obtain recombinant virion. For example, recombinant virion can be 
25 produced by an AAV5 D region, an AAV5 TRS site, an AAV2 hairpin, an AAV2 binding 
site, AAV5 Rep protein and AAV1 capsid. This recombinant virion would possess the 
cellular tropism conferred by the AAV1 capsid.protein and would possess the efficient 
replication conferred by the AAV5 Rep. 

Other examples of the ITR, Rep protein and Capsids that will produce recombinant 
30 virion are provided in the list below: 
5ITR + 5Rep + 5Cap=virion 
51TR + 5Rep + lCap=virion 
5ITR + 5Rep + 2Cap=virion 
5ITR + 5Rep + 3Cap=virion 
35 5ITR + 5Rep + 4Cap=virion 

5ITR + 5Rep + 6Cap=virion 
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5 1 ITR + IRep + 5Cap=virion 

2ITR + 2Rep .+ 5Cap=virion 
3ITR + 3Rep + 5Cap=virion 
4ITR + 4Rep + 5Cap=virion 
6ITR + 6Rep + 5Cap=virion 
10 In any of the constructs described herein, inclusion of a promoter is preferred. As 

used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 
AAV5 VP1, AAV5 VP2, AAV5 VP3, combinations thereof, functional fragments of any of 
VP1, VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
described herein, can be chimeric recombinant ITRs as described elsewhere in the 
15 application. 

Conjugates of recombinant or wild-type AAV5 virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified AAV5 can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 

20 molecule. Also contemplated are virosomes that contain AAV5 structural proteins (AAV5 
capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize theAAV5 capsid or a 
unique region of the AAV5 capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 

25 to introduce DNA into cells. For example, the type 5 VP3 protein or fragment thereof, can 
be conjugated to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected 
using the targeting ability of the VP3 capsid protein to achieve the desired tissue tropism, 
specific to AAV5. Type 5 VP1 and VP2 proteins can also be utilized to introduce DNA or 
other molecules into cells. By further incorporating the Rep protein and the AAV TRS into 

30 the DNA-containing conjugate, cells can be transduced and targeted integration can be 
achieved. For example, if AAV5-specific targeted integration is desired, a conjugate 
composed of the AAV 5 VP'* ^g^^L k^qt a fragment of AAV5 rep, AAV5 TRS, 

the rep binding site, the heterologous DNA of interest, and a lipid, can be utilized to achieve 
AAV 5 specific tropism and~AAV5" specific targeted integration in the genome. 

35 Further provided by this invention are chimeric viruses where AAV5 can be 

combined with herpes virus, herpes virus amplicons, baculovirus or other viruses to achieve 
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5 a desired tropism associated with another virus. For example, the AAV5 ITRs could be 
inserted in the herpes virus and cells could be infected. Post-infection, the ITRs of AAV5 
could be acted on by AAV5 rep provided in the system or in a separate vehicle to rescue 
AAV 5 from the genome. Therefore, the cellular tropism of the herpes simplex virus can be 
combined with AAV5 rep mediated targeted integration. Other viruses that could be utilized 

1 0 to construct chimeric viruses include, lentivirus, retrovirus, pseudotyped retroviral vectors, 
and adenoviral vectors. 

The present invention further provides isolated nucleic acids of AAV5. For example, 
provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 23 (AAV5 genome). This nucleic acid, or portions thereof, can be inserted into vectors, 

15 such as plasmids, yeast artificial chromosomes, or other viral vector (particle), if desired, by 
standard cloning methods. The present invention also provides an isolated nucleic acid 
consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 23. The 
nucleotides of SEQ ID NO: 23 can have minor modifications and still be contemplated by 
the present invention. For example, modifications that do not alter the amino acid encoded 

20 by any given codon (such as by modification of the third, "wobble," position in a codon) can 
readily be made, and such alterations are known in the art. Furthermore, modifications that 
cause a resulting neutral (conserved) amino acid substitution of a similar amino acid can be 
made in a coding region of the genome. Additionally, modifications as described herein for 
the AAV 5 components, such as the ITRs, the p5 promoter, etc. are contemplated in this 

25 invention. Furthermore, modifications to regions of SEQ ID NO: 23 other than in the ITR, 
TRS Rep binding site and hairpin are likely to be tolerated without serious impact on the 
function of the nucleic acid as a recombinant vector. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with any nucleic acid disclosed herein, including the entire AAV5 genome and 

30 any unique fragment thereof, including the Rep and capsid encoding sequences (e.g. SEQ ED 
NOS: 23, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, and 45). Specifically, the 
nucleic acid can selectively or specifically hybridize to an isolated nucleic acid consisting of 
the nucleotide sequence set forth in SEQ ID NO: 23 (AAV5 genome). The present invention 
further provides an isolated nucleic acid that selectively or specifically hybridizes with an 

35 isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 23 

(AAV5 genome). By "selectively hybridizes" as used herein is meant a nucleic acid that 
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5 hybridizes to one of the disclosed nucleic acids under sufficient stringency conditions 
without significant hybridization to a nucleic acid encoding an unrelated protein, and 
particularly, without detectably hybridizing to nucleic acids of AAV2. Thus, a nucleic acid 
that selectively hybridizes with a nucleic acid of the present invention will not selectively 
hybridize under stringent conditions with a nucleic acid encoding a different protein or the 

10 corresponding protein from a different serotype of the virus, and vice versa. A "specifically 
hybridizing" nucleic acid is one that hybridizes under stringent conditions to only a nucleic 
acid found in AAV5. Therefore, nucleic acids for use, for example, as primers and probes to 
detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 
that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 

1 5 probes for further hybridization or for amplification methods (e.g., polymerase chain 

reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both AAV5 and a gene of interest carried 
within the AAV5 vector (i.e., a chimeric nucleic acid). 

A nucleic acid that selectively hybridizes to any portion of the AAV5 genome is 

20 contemplated herein. Therefore, a nucleic acid that selectively hybridizes to AAV5 can be of 
longer length than the AAV 5 genome, it can be about the same length as the AAV 5 genome 
or it can be shorter than the AAV5 genome. The length of the nucleic acid is limited on the 
shorter end of the size range only by its specificity for hybridization to AAV5, i.e., once it is 
too short, typically less than about 5 to 7 nucleotides in length, it will no longer bind 

25 specifically to AAV5, but rather will hybridize to numerous background nucleic acids. 
Additionally contemplated by this invention is a nucleic acid that has a portion that 
specifically hybridizes to AAV5 and a portion that specifically hybridizes to a gene of 
interest inserted within AAV5. 

The present invention further provides an isolated nucleic acid encoding an adeno- 

30 associated virus 5 Rep protein. The AAV5 Rep proteins are encoded by open reading frame 
(ORF) 1 of the AAV5 genome. Ex amples-of-the-AAV5 Rep genes are shown in the nucleic 
acid set forth in SEQ ID NO: 23u and iiaGfe^»«cleic acids consisting essentially of the 
nucleotide sequences set forth in SEQ ID NOS: 32 (Rep52), 33 (Rep78), 35 (Rep40), and 37 
(Rep68), and nucleic acids comprising the nucleotide sequences set forth in SEQ ID NOS: 

35 32, 33, 35, and 37. However, the present invention contemplates that the Rep nucleic acid 
can include any one, two, three, or four of the four Rep proteins, in any order, in such a 
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5 nucleic acid. Furthermore, minor modifications are contemplated in the nucleic acid, such as 
silent mutations in the coding sequences, mutations that make neutral or conservative 
changes in the encoded amino acid sequence, and mutations in regulatory regions that do not 
disrupt the expression of the gene. Examples of other minor modifications are known in the 
art. Further modifications can be made in the nucleic acid, such as to disrupt or alter 

1 0 expression of one or more of the Rep proteins in order to, for example, determine the effect 
of such a disruption; such as to mutate one or more of the Rep proteins to determine the 
resulting effect, etc. However, in general, a modified nucleic acid encoding a Rep protein 
will have at least about 85%, about 90%, about 93%, about 95%, about 98% or 100% 
homology to the Rep nucleic sequences described herein e.g., SEQ ED NOS: , 11, 13 and 15 

1 5 32, 33, 35 and 37, and the Rep polypeptide encoded therein will have overall about 93%, 
about 95%, about 98%, about 99% or 100% homology with the amino acid sequence 
described herein, e.g., SEQ ED NOS: 24, 25, 34 and 36. Percent homology is determined by 
the techniques described herein. 

The present invention also provides an isolated nucleic acid that selectively or 

20 specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set forth in SEQ ID NOS: 32, 33, 35 and 37, and an isolated nucleic acid that selectively 
hybridizes with a nucleic acid comprising the nucleotide sequence set forth in SEQ ID NOS: 
32, 33, 35 and 37. "Selectively hybridizing" and "stringency of hybridization" is defined 
elsewhere herein. 

25 As described above, provided is the nucleic acid encoding a Rep 40 protein and, in 

particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 35, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 35, and a nucleic acid encoding the adeno-associated virus 5 protein having the 
amino acid sequence set forth in SEQ ED NO: 34. The present invention also provides the 

30 nucleic acid encoding a Rep 52 protein, and in particular an isolated nucleic acid comprising 
the nucleotide sequence set forth in SEQ ID NO: 32, an isolated nucleic acid consisting 
essentially of the nucleotide sequence set forth in SEQ ID NO: 32, and a nucleic acid 
encoding the adeno-associated virus 5 Rep protein having the amino acid sequence set forth 
in SEQ ID NO: 24. The present invention further provides the nucleic acid encoding a Rep 

35 68 protein and, in particular an isolated nucleic acid comprising the nucleotide sequence set 
forth in SEQ ID NO: 37, an isolated nucleic acid consisting essentially of the nucleotide 
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5 sequence set forth in SEQ ID NO: 37, and a nucleic acid encoding the adeno-associated 
virus 5 protein having the amino acid sequence set forth in SEQ ID NO: 36. And, further, 
provided is the nucleic acid encoding a Rep 78 protein, and in particular an isolated nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 33, an isolated nucleic 
acid consisting essentially of the nucleotide sequence set forth in SEQ ED NO: 33, and a 

1 0 nucleic acid encoding the adeno-associated virus 5 Rep protein having the amino acid 

sequence set forth in SEQ ID NO: 25. As described elsewhere herein, these nucleic acids 
can have minor modifications, including silent nucleotide substitutions, mutations causing 
conservative amino acid substitutions in the encoded proteins, and mutations in control 
regions that do not or minimally affect the encoded amino acid sequence. 

1 5 The present invention further provides a nucleic acid encoding the entire AAV5 

Capsid polypeptide. Furthermore, provided is a nucleic acid encoding each of the three 
AAV 5 coat proteins, VP1, VP2, and VP3. Thus, provided is a nucleic acid encoding AAV5 
VP1, a nucleic acid encoding AAV 5 VP2, and a nucleic acid encoding AAV5 VP3. Thus, 
provided is a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 26 

20 (VP1); a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 27(VP2), 
and a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 28 (VP3). 
The present invention also specifically provides a nucleic acid comprising SEQ ID NO: 29 
(VP1 gene); a nucleic acid comprising SEQ ID NO: 30 (VP 2 gene); and a nucleic acid 
comprising SEQ ID NO: 31 (VP3 gene). The present invention also specifically provides a 

25 nucleic acid consisting essentially of SEQ ID NO: 29 (VP1 gene), a nucleic acid consisting 
essentially of SEQ ID NO: 30 (VP2 gene), and a nucleic acid consisting essentially of SEQ 
ID NO: 31 (VP3 gene). Minor modifications in the nucleotide sequences encoding the 
capsid, or coat, proteins are contemplated, as described above for other AAV5 nucleic acids. 
However, in general, a modified nucleic acid encoding a capsid protein will have at least 

30 about 85%, about 90%, about 93%, about 95%, about 98% or 100% homology to the capsid 
nucleic sequences described herein e.g., SEQ ID NOS: 29, 30 and 31, and the capsid 
polypeptide encoded therein will have overall about 93%, about 95%, about 98%, about 
99% or 100% homology with the amino acid sequence described herein, e.g., SEQ H> NOS: 
26, 27, and 28. Nucleic acids that selectively hybridize with the nucleic acids of SEQ ID 

35 NOS: 29, 30, and 31 under the conditions described above are also provided. 
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5 Provided is an isolated AAV 5 Rep protein. An AAV5 Rep polypeptide is encoded 

by 0RF1 of AAV5. The present invention also provides each individual AAV5 Rep protein. 
Thus provided is AAV 5 Rep 40 (e.g., SEQ ID NO: 34), or a unique fragment thereof. 
Provided is AAV5 Rep 52 (e.g., SEQ ID NO: 24), or a unique fragment thereof. Provided is 
AAV5 Rep 68 (e.g., SEQ ED NO: 36), or a unique fragment thereof. Provided is an example 

10 of AAV5 Rep 78 (e.g., SEQ ID NO: 25), or a unique fragment thereof. By "unique fragment 
thereof is meant any smaller polypeptide fragment encoded by an AAV5 rep gene that is of 
sufficient length to be found only in the Rep polypeptide. Substitutions and modifications of 
the amino acid sequence can be made as described above and, further, can include protein 
processing modifications, such as glycosylation, to the polypeptide. 

1 5 The present invention further provides an AAV5 Capsid polypeptide or a unique 

fragment thereof. AAV5 capsid polypeptide is encoded by ORF 2 of AAV5. The present 
invention further provides the individual AAV 5 capsid proteins, VP1, VP2 and VP3 or 
unique fragments thereof. Thus, provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO: 26 (VP 1). The present invention additionally provides an 

20 isolated polypeptide having the amino acid sequence set forth in SEQ ED NO: 27 (VP2). The 
present invention also provides an isolated polypeptide having the amino acid sequence set 
forth in SEQ ID NO: 28 (VP3). By "unique fragment thereof 1 is meant any smaller 
polypeptide fragment encoded by any AAV 5 capsid gene that is of sufficient length to be 
found only in the AAV5 capsid protein. Substitutions and modifications of the amino acid 

25 sequence can be made as described above and, further, can include protein processing 
modifications, such as glycosylation, to the polypeptide. However, an AAV5 Capsid 
polypeptide including all three coat proteins will have greater than about 56% overall 
1 homology to the polypeptide encoded by the nucleotides set forth in SEQ ID NOS: 26, 27, 

or 28. The protein can have about 65%, about 70%, about 75%, about 80%, about 85%, 

30 about 90%, 93%, 95%, 97% or even 100% homology to the amino acid sequence encoded 
by-the nucleotides set forth in SEQ ID NOS: 26, 27 or 28. An AAV5 VP1 polypeptide can 
baKP at teast about 58%, about 60%, about 70%, about 80%, about 90%, 93%, 95%, 97% or 
about 100% homology to the amino acid sequence set forth in SEQ ID NO: 26. An AAV5 
VP2 polypeptide can have at least about 58%, about 60%, about 70%, about 80%, about 

35 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth in SEQ 
ID NO: 27. An AAV5 VP 3 polypeptide can have at least about 60%, about 70%, about 
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5 80%, about 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set 
forth in SEQ ID NO: 28. 

The AAV ITRs in the vector for the herein described delivery methods can be AAV5 
ITRs (SEQ ID NOS: 41 and 42). Furthermore, the AAV ITRs in the vector for the herein 
described nucleic acid delivery methods can also comprise AAV1, AAV2, AAV3, AAV4, or 
10 AAV6 inverted terminal repeats. 

Compositions and methods for making BAAV vectors 

Compositions and methods for making and using BAAV vectors have been 
previously described in U.S. Patent Application No. 09/533427, incorporated herein by 

1 5 reference for this teaching. 

Provided is a recombinant bovine adeno-associated virus (BAAV). This virus has 
one or more of the characteristics described below. The compositions of the present 
invention do not include wild-type BAAV. The methods of the present invention can use 
either wild-type BAAV or recombinant BAAV-based delivery. 

20 Provided are novel BAAV particles, recombinant BAAV vectors and recombinant 

BAAV virions. An BAAV particle is a viral particle comprising an BAAV capsid protein. A 
recombinant BAAV vector is a nucleic acid construct that comprises at least one unique 
nucleic acid of BAAV. A recombinant BAAV virion is a particle containing a recombinant 
BAAV vector, wherin the particle can be either an BAAV particle as described herein or a 

25 non-BAAV particle. Alternatively, the recombinant BAAV virion is an BAAV particle 
containing a recombinant vector, wherein the vector can be either an BAAV vector as 
described herein or a non-BAAV vector. These vectors, particles, virions, nucleic acids and 
polypeptides are described below. 

Provided is the nucleotide sequence of the BAAV genome and vectors and particles 

30 derived therefrom. Specifically, provided is a nucleic acid vector comprising a pair of 
BAAV inverted terminal repeats (ITRs) and a promoter between the inverted terminal 
repeats. The rep proteins of AAV5 and BAAV will bind to the BAAV ITR and it can 
function as an origin of replication for packaging of recombinant AAV particles. The 
minimum sequence necessary for this activity is the TRS site where Rep cleaves in order to 

35 replicate the virus. Minor modifications in an ITR are contemplated and are those that will 

not interfere with the hairpin structure formed by the ITR as described herein and known in 
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5 the art. Furthermore, to be considered within the term e.g. it must retain the Rep binding site 
described herein. 

The D- region of the AAV2 ITR, a single stranded region of the ITR, inboard of the 
TRS site, has been shown to bind a factor which depending on its phosphorylation state 
correlates with the conversion of the AAV from a single stranded genome to a 

10 transcriptionally active form that allows for expression of the viral DNA. This region is 
conserved between AAV2, 3, 4,and 6 but is divergent in AAV5 and BAAV (SEQ ID NO: 
59). The D+ region is the reverse complement of the D- region. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 

1 5 type in which the vector is to be used. That is, the promoter can be tissue/cell-specific. 
Promoters can be prokaryotic, eukaryotic, fungal, nuclear, mitochondrial, viral or plant 
promoters. Promoters can be exogenous or endogenous to the cell type being transduced by 
the vector. Promoters can include, for example, bacterial promoters, known strong 
promoters such as SV40 or the inducible metallothionein promoter, or an AAV promoter, 

20 such as an AAV p5 promoter. Additionally, chimeric regulatory promoters for targeted gene 
expression can be utilized. Examples of these regulatory systems, which are known in the 
art, include the tetracycline based regulatory system which utilizes the tet transactivator 
protein (tTA), a chimeric protein containing the VP 16 activation domain fused to the tet 
repressor of Escherichia coli, the IPTG based regulatory system, the CED based regulatory 

25 system, and the Ecdysone based regulatory system. Other promoters include promoters 
derived from actin genes, immunoglobulin genes, cytomegalovirus (CMV), adenovirus, 
bovine papilloma virus, adenoviral promoters, such as the adenoviral major late promoter, 
an inducible heat shock promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), 
etc., specifically, the promoter can be AAV 2 p5 promoter or AAV 5 p5 promoter or BAAV 

30 p5 promoter. More specifically, the BAAV p5 promoter can be in about the same location in 
SEQ ID NO: 47 as the AAV2 p5 promoter, in the corresponding AAV2 published sequence. 
Additionally, the p5 promoter may be enhanced by nucleotides 1-173 of SEQ ID NO: 47. 
Furthermore, smaller fragments of p5 promoter that retain promoter activity can readily be 
determined by standard procedures including, for example, constructing a series of deletions 

35 in the p5 promoter, linking the deletion to a reporter gene, and determining whether the 
reporter gene is expressed, i.e., transcribed and/or translated. The promoter can be the 
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5 promoter of any of the AAV serotypes, and can be the pi 9 promoter (SEQ ID NO: 62) or 
the p40 promoter set forth in the sequence listing as SEQ ID NO: 63. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 
herein can be corrected, for example, by using the hybridization procedure described below 
with various probes derived from the described sequences such that the coding sequence can 

1 0 be reisolated and resequenced. Rapid screening for point mutations can also be achieved 
with the use of polymerase chain reaction single strand conformation polymorphism (PCR 
SSCP). The corresponding amino acid sequence can then be corrected accordingly. 

The BA AV-derived vector can include any normally occurring BAAV nucleic acid 
sequences in addition to an ITR and promoter. The BAAV-derived vector can also include 

15 sequences that are at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
the BAAV nucleic acids set forth herein. Examples of vector constructs are provided below. 

The present vector or BAAV particle or recombinant BAAV virion can utilize any 
unique fragment of these present BAAV nucleic acids, including the BAAV nucleic acids 
set forth in SEQ ID NOS: 47, 48, 50, 52, 54, 56 and 58-63. To be unique, the fragment must 

20 be of sufficient size to distinguish it from other known sequences, most readily determined 
by comparing any nucleic acid fragment to the nucleotide sequences of nucleic acids in 
computer databases, such as GenBank. Such comparative searches are standard in the art. 
Typically, a unique fragment useful as a primer or probe will be at least about 8 or 10, 
preferable at least 20 or 25 nucleotides in length, depending upon the specific nucleotide 

25 content of the sequence. Additionally, fragments can be, for example, at least about 30, 40, 
50, 75, 100, 200 or 500 nucleotides in length and can encode polypeptides or be probes. The 
nucleic acid can be single or double stranded, depending upon the purpose for which it is 
intended. Where desired, the nucleic acid can be RNA. 

The present invention further provides a BAAV capsid protein to contain the vector. 

30 In particular, provided is not only a polypeptide comprising all three BAAV coat proteins, 
i.e., VP1, VP2 and VP3, butalso a polypeptide comprising each BAAV coat protein 
individually, SEQ ID NOS: 53. 55. and 57 nespectively. Thus, an BAAV particle 
comprising an BAAV capsid protein comprises at least one BAAV coat protein VP1, VP2 
or VP3. A BAAV particle comprising an BAAV capsid protein can be utilized to deliver a 

35 nucleic acid vector to a cell, tissue or subject. For example, the herein described BAAV 
vectors can be encapsidated in an AAV5 capsid-derived particle and utilized in a gene 
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5 delivery method. Furthermore, other viral nucleic acids can be encapsidated in the BAAV 
particle and utilized in such delivery methods. For example, an AAV 1-8 or AAAV vector 
(e.g. AAV 1-8 or AAAV ITR and nucleic acid of interest) can be encapsidated in an BAAV 
particle and administered. Furthermore, a BAAV chimeric capsid incorporating both AAV1- 
8 or AAAV capsid and BAAV capsid sequences can be generated, by standard cloning 

1 0 methods, selecting regions from the known sequences of each protein as desired. For 

example, particularly antigenic regions of the BAAV capsid protein can be replaced with the 
corresponding region of the BAAV capsid protein. In addition to chimeric capsids 
incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3-8, and 
AAV5 capsid sequences can be generated, by standard cloning methods, selecting regions 

1 5 from the known sequences of each protein as desired. Alternatively a chimeric capsid can be 
made by the addition of a plasmid that expresses AAV 1-8 capsid proteins at a ratio with the 
- BAAV capsid expression plasmid that allows only a few capsid proteins to be incorpated 
into the BAAV particle. Thus, for example, a chimeric particle may be constructed that 
contains 6 AAV2 capsid proteins and 54 BAAV capsid proteins if the complete capsid 

20 contains 60 capsid proteins. 

The capsids can also be modified to alter their specific tropism by genetically 
altering the capsid to encode a specific ligand to a cell surface receptor. Alternatively, the 
capsid can be chemically modified by conjugating a ligand to a cell surface receptor. By 
genetically or chemically altering the capsids, the tropism can be modified to direct BAAV 

25 to a particular cell or population of cells. The capsids can also be altered immunologically 
by conjugating the capsid to an antibody that recognizes a specific protein on the target cell 
or population of cells. 

It has been recently reported that insertion of foreign epitopes (RGD motif, LH 
receptor targeting epitope) in certain regions of AAV2 capsid can redirect viral tropism. 

30 However, AAV2 naturally infects a wide variety of cell types and complete retargeting of 
rAAV2 would be difficult to achieve. Provided are two regions in the capsid of BAAV that 
are on the virus surface and could tolerate substitution. These two regions are aa 257-264 
(GSSNASDT) and aa 444-457 (TTSGGTLNQGNSAT). Other regions of the BAAV capsid 
could also accommodate the substitution of amino acids that would allow for epitope 

35 presentation on the surface of the virus. All of these regions would have in common 1) 
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5 Surface exposure 2) able to support a substitution of sequence to insert the epitope 3) still 
allow for capsid assembly. 

Because of the symmetry of the AAV particles, a substitution in one subunit of the 
capsid will appear multiple times on the capsid surface. For example the capsid is made of 
approximately 55 VP3 proteins. Therefore an epitope incorporated in the VP3 protein could 
10 be expressed 55 times on the surface of each particle increasing the likelihood of the epitope 
forming a stable interaction with its target. In some cases this may be too high of a ligand 
density for functional binding or this high density of epitope may interfere with capsid 
formation. The epitope density could be lowered by introducing another plasrnid into the 
packaging system for production of recombinant particles and the ratio between the 
1 5 packaging plasrnid with the modified VP3 protein and the wt VP3 protein altered to balance 
the epitope density on the virus surface. 

Epitopes could be incorporated into the virus capsid for the purpose of 1) altering the 
tropism of the virus 2) blocking an immune response direct at the virus 3) developing a host 
immune response to the epitope for the purpose of vaccination. 
20 Examples of epitopes that could be added to BAAV capsids include but are not 

limited to: 

LH receptor binding epitope 
RGD integrin binding epitope 
CD 13 binding epitope NGRAHA 
25 The Retanef polyprotein vaccine candidate for HIV-1 

single chain antibody fragments directed against tumor cells 
Endothelial cell binding epitope SIGYPLP 
serpin receptor ligand, KFNKPFVFLI 

protective B-cell epitope hemagglutinin (HA) 91-108 from influenza HA 
30 NDV B-cell immunodominant epitope (IDE) spanning residues 447 to 455 

Major immunogenic epitope for parvovirus B19 ( N1SLDNPLENPSSLFDLVARIK) 
that can elicit protective antibody titers. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
35 VP3 and VP2 capsid proteins in baculo virus. The same basic protocol can produce an empty 
BAAV particle comprising BAAV capsid proteins and also full particles. 
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5 The herein described recombinant BAAV nucleic acid derived vector can be 

encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle or an AAV6 or 
AAV7 or an AAV 8 or AAAV particle, a portion of any of these capsids, or a chimeric 
capsid particle as described above, by standard methods using the appropriate capsid 

10 proteins in the encapsidation process, as long as the nucleic acid vector fits within the size 
limitation of the particle utilized. The encapsidation process itself is standard in the art. The 
BAAV replication machinery, i.e. the rep initiator proteins and other functions required for 
replication, can be utilized to produce the BAAV genome that can be packaged in an 
AAV 1-8 or AAAV capsid. 

1 5 The recombinant BAAV virion containing a vector can also be produced by 

recombinant methods utilizing multiple plasmids. In one example, the BAAV rep nucleic 
acid would be cloned into one plasmid, the BAAV ITR nucleic acid would be cloned into 
another plasmid and the AAV1-8 capsid nucleic acid would be cloned on another plasmid. 
These plasmids would then be introduced into cells. The cells that were efficiently 

20 transduced by all three plasmids, would exhibit specific integration as well as the ability to 
produce BAAV recombinant virus. Additionally, two plasmids could be used where the 
BAAV rep nucleic acid would be cloned into one plasmid and the BAAV ITR and BAAV 
capsid would be cloned into another plasmid. These plasmids would then be introduced into 
cells. The cells that were efficiently transduced by both plasmids, would exhibit specific 

25 integration as well as the ability to produce BAAV recombinant virus. 

An BAAV capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide 
can overall have greater than 56% homology to the polypeptide having the amino acid 
sequence encoded by nucleotides in SEQ ID NOS: 52, 54 and 56. The capsid protein can 
have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 

30 homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the aminn ar.iH sequence encoded by the nucleotides set forth in SEQ ID 
NjQS- 52. 54 and 56. The percent homology used to identify proteins herein, can be based on 
a nucleotide-by-nucleotide comparison or more preferable is based on a computerized 
algorithm as described herein. Variations in the amino acid sequence of the BAAV capsid 

35 protein are contemplated herein, as long as the resulting particle comprising an BAAV 

capsid protein remains antigenically or immunologically distinct from AAV 1-8 or AAAV 
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5 capsid, as can be routinely determined by standard methods. Specifically, for example, 

ELISA and Western blots can be used to determine whether a viral particle is antigenicity 
or immunologically distinct from AAV2 or the other serotypes. Furthermore, the BAAV 
particle preferably retains tissue tropism distinction from other AAVs, such as that 
exemplified in the examples herein. A BAAV chimeric particle comprising at least one 
1 0 BAAV coat protein may have a different tissue tropism from that of an BAAV particle 
consisting only of BAAV coat proteins, but is still distinct from the tropism of an AAV2 
particle. 

The invention further provides a recombinant BAAV virion, comprising a BAAV 
particle containing, i.e., encapsidating, a vector comprising a pair of BAAV inverted 
1 5 terminal repeats. The recombinant vector can further comprise a BAAV Rep-encoding 
nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. 

The invention further contemplates chimeric recombinant ITRs that contain a rep 
binding site and a TRS site recognized by that Rep protein. By "Rep protein" is meant all 
20 four of the Rep proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" 

could be one or more of the Rep proteins described herein. One example of a chimeric ITR 
would consist of an BAAV D region (SEQ ID NO: 59), an BAAV TRS site (SEQ ID NO: 
60), an AAV2 hairpin and an AAV2 Rep binding site. Another example would be a BAAV 
D region, an BAAV TRS site, an AAV3 hairpin and an AAV3 Rep binding site. In these 
25 chimeric ITRs, the D region can be from AAV1-8 or AAAV. The hairpin can be derived 

from AAV 1-8 or AAAV. The binding site can be derived from any of AAV 1-8 or AAAV. 
Preferably, the D region and the TRS are from the same serotype. 

The chimeric ITRs can be combined with BAAV Rep protein and any of the AAV 
serotype capsids to obtain recombinant virion. For example, recombinant virion can be 
30 produced by a BAAV D region, an BAAV TRS site, an AAV2 hairpin, an AAV2 binding 
site, BAAV Rep protein ancLAAV 1 capsidLJhisjcecomhinant virion would possess the 
cellular tropism conferred bv tb<& AAV'i.^cgggj^protein and would possess the efficient 
replication conferred by the BAAV Rep. 

Other examples of"the-ITR7"Rep"protein"and Capsids that will produce recombinant 
35 virus are provided in the list below but not limited to : 
BAAV ITR + BAAV Rep + BAAV Cap=virus 
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5 AAV5 ITR + BAAV Rep + BAAV Cap=virus 

AAV5 ITR + BAAV Rep + AAV1 Cap=virus 

AAV5 ITR + BAAV Rep + AAV2 Cap=virus 

AAV5 ITR + BAAV Rep + AAV3 Cap=virus 

AAV5 ITR + BAAV Rep + AAV4 Cap=virus 
1 0 AAV5 ITR + BAAV Rep + AAV5 Cap= virus 

AA V5 ITR + BAAV Rep + AAV6 Cap=virus 

AAV5 ITR + BAAV Rep + AAV7 Cap=virus 

AAV5 ITR + BAAV Rep + AAV8 Cap=virus 

BAAV ITR + AAV5 Rep + BAAV Cap=virus 
1 5 BAAV ITR + AAV5 Rep + AAV1 Cap=virus 

BAAV ITR + AAV5 Rep + AAV2 Cap=virus 

BAAV ITR + AAV 5 Rep + AAV3 Cap=virus 

BAAV ITR + AAV5 Rep + AAV4 Cap=virus 

BAAV ITR + AAV5 Rep + AAV5 Cap=virus 
20 BAAV ITR + AAV 5 Rep + AAV6 Cap=virus 

BAAV ITR + AAV5 Rep + AAV7 Cap=virus 

BAAV ITR + AAV5 Rep + AAV8 Cap=virus 

AA V5 ITR + AAV5 Rep + BAAV Cap=virus 

AAV1 ITR + AAV1 Rep + BAAV Cap=virus 
25 AAV2 ITR + AAV2 Rep + BAAV Cap=virus 

AAV3 ITR + AAV3 Rep + BAAV Cap=virus 

AAV4 ITR + AAV4 Rep + BAAV Cap=virus 

AAV5 ITR + AAV5 Rep + BAAV Cap=virus 

AAV6 ITR + AAV6 Rep + BAAV Cap=virus 
30 AAV7 ITR + AAV7 Rep + BAAV Cap=virus 

AAV8 ITR + AAV8 Rep + BAAV Cap=virus 

In any of the constructs described herein, inclusion of a promoter is preferred. As 
used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 
35 BAAV VP 1, BAAV VP2, BAAV VP3, combinations thereof, functional fragments of any 
of VP1, VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
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5 described herein, can be chimeric recombinant ITRs as described elsewhere in the 
application. 

Conjugates of recombinant or wild-type BAAV virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified BAAV can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 

1 0 to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 

molecule. Also contemplated are virosomes that contain BAAV structural proteins (BAAV 
capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize the BAAV capsid or a 

15 unique region of the BAAV capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 
to introduce DNA into cells. For example, the BAAV VP3 protein or fragment thereof, can 
be conjugated to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected 
using the targeting ability of the VP3 capsid protein to achieve the desired tissue tropism, 
specific to BAAV. BAAV VP1 and VP2 proteins can also be utilized to introduce DNA or 

20 other molecules into cells. By further incorporating the Rep protein and the AAV TRS into 
the DNA-containing conjugate, cells can be transduced and targeted integration can be 
achieved. For example, if BAAV specific targeted integration is desired, a conjugate 
composed of the BAAV VP3 capsid, BAAV rep or a fragment of BAAV rep, BAAV TRS, 
the rep binding site, the heterologous DNA of interest, and a lipid, can be utilized to achieve 

25 BAAV specific tropism and BAAV specific targeted integration in the genome. 

Further provided by this invention are chimeric viruses where BAAV can be 
combined with herpes virus, baculovirus or other viruses to achieve a desired tropism 
associated with another virus. For example, the BAAV ITRs could be inserted in the herpes 
virus and cells could be infected. Post-infection, the ITRs of BAAV could be acted on by 

30 BAAV rep provided in the system or in a separate vehicle to rescue BAAV from the 

genome. Therefore, the cellular tropism of the herpes simplex virus can be combined with 
BAAV rep mediated targeted iiitegratiap Otfeer viruses that could be utilized to construct 
chimeric viruses include, lentivirus, retrovirus, pseudotyped retroviral vectors, and 
adenoviral vectors. 

35 The present invention further provides isolated nucleic acids of BAAV. For 

example, provided is an isolated nucleic acid comprising the nucleotide sequence set forth 
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5 in SEQ ID NO: 47 (BAAV genome). This nucleic acid, or portions thereof, can be inserted 
into vectors, such as plasmids, yeast artificial chromosomes, or other viral vector (particle), 
if desired, by standard cloning methods. The present invention also provides an isolated 
nucleic acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 47. 
The nucleotides of SEQ ID NO: 47 can have minor modifications and still be contemplated 

10 by the present invention. For example, modifications that do not alter the amino acid 

encoded by any given codon (such as by modification of the third, "wobble," position in a 
codon) can readily be made, and such alterations are known in the art. Furthermore, 
modifications that cause a resulting neutral (conserved) amino acid substitution of a similar 
amino acid can be made in a coding region of the genome. Additionally, modifications as 

1 5 described herein for the BAAV components, such as the ITRs, the p5 promoter, etc. are 

contemplated in this invention. Furthermore, modifications to regions of SEQ ID NO:-!- 47 
other than in the ITR, TRS, Rep binding site and hairpin are likely to be tolerated without 
serious impact on the function of the nucleic acid as a recombinant vector. 

The present invention additionally provides an isolated nucleic acid that selectively 

20 hybridizes with any nucleic acid disclosed herein, including the entire BAAV genome and 

any unique fragment thereof, including the Rep and capsid encoding sequences (e.g. SEQ ED 
NOS: 47, 48, 50, 52, 54, 56, 58, 59, 60, 61, 62, 63). Specifically, the nucleic acid can 
selectively or specifically hybridize to an isolated nucleic acid consisting of the nucleotide 
sequence set forth in SEQ ID NO: 47 (BAAV genome). The present invention further 

25 provides an isolated nucleic acid that selectively or specifically hybridizes with an isolated 
nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 47 (BAAV 
genome). By "selectively hybridizes" as used herein is meant a nucleic acid that hybridizes 
to one of the disclosed nucleic acids under sufficient stringency conditions without 
significant hybridization to a nucleic acid encoding an unrelated protein, and particularly, 

30 without detectably hybridizing to nucleic acids of AAV2. Thus, a nucleic acid that 

selectively hybridizes with a nucleic acid of the present invention will not selectively 
hybridize under stringent conditions with a nucleic acid encoding a different protein or the 
corresponding protein from a different serotype of the virus, and vice versa. A "specifically 
hybridizing" nucleic acid is one that hybridizes under stringent conditions to only a nucleic 

35 acid found in BAAV. Therefore, nucleic acids for use, for example, as primers and probes to 
detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 

53 

233334_I7.DOC 



ATTORNEY DOCKET NO- 14014.0427U1 



5 that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 
probes for further hybridization or for amplification methods (e.g., polymerase chain 
reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both BAAV and a gene of interest carried 
within the BAAV vector (i.e., a chimeric nucleic acid). 

10 A nucleic acid that selectively hybridizes to any portion of the BAAV genome is 

contemplated herein. Therefore, a nucleic acid that selectively hybridizes to BAAV can be 
of longer length than the BAAV genome, it can be about the same length as the BAAV 
genome or it can be shorter than the BAAV genome. The length of the nucleic acid is 
limited on the shorter end of the size range only by its specificity for hybridization to 

1 5 BAAV, i.e., once it is too short, typically less than about 5 to 7 nucleotides in length, it will 
no longer bind specifically to BAAV, but rather will hybridize to numerous background 
nucleic acids. Additionally contemplated by this invention is a nucleic acid that has a 
portion that specifically hybridizes to BAAV and a portion that specifically hybridizes to a 
gene of interest inserted within BAAV. 

20 The present invention further provides an isolated nucleic acid encoding a bovine 

adeno-associated virus Rep protein. The BAAV Rep proteins are encoded by open reading 
frame (ORF) 1 of the BAAV genome. Examples of the BAAV Rep genes are shown in the 
nucleic acid set forth in SEQ ID NO: 47, and include nucleic acids consisting essentially of 
the nucleotide sequences set forth in SEQ ID NOS: 48 (rep78), 4(rep52) and nucleic acids 

25 comprising the nucleotide sequences set forth in SEQ ID NOS: 48 and 50. However, the 

present invention contemplates that the Rep nucleic acid can include any one, two, three, or 
four of the four Rep proteins, in any order, in such a nucleic acid. Furthermore, minor 
modifications are contemplated in the nucleic acid, such as silent mutations in the coding 
sequences, mutations that make neutral or conservative changes in the encoded amino acid 

30 sequence, and mutations in regulatory regions that do not disrupt the expression of the gene. 
Examples of other minor modifications are known in the art. Further modifications can be 
made in the nucleic acid, such as to disrupt or alter expression of one or more of the Rep 
proteins in order to, for example, determine the effect of such a disruption; such as to mutate 
one or more of the Rep proteins to determine the resulting effect, etc. However, in general, a 

35 modified nucleic acid encoding a Rep protein will have at least about 85%, about 90%, 
about 93%, about 95%, about 98% or 100% homology to the Rep nucleic sequences 
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5 described herein e.g., SEQ ID NOS: 48 and 50, and the Rep polypeptide encoded therein 
will have overall about 93%, about 95%, about 98%, about 99% or 100% homology with 
the amino acid sequence described herein, e.g., SEQ ID NOS: 49 and 51. Percent homology 
is determined by the techniques described herein. 

The present invention also provides an isolated nucleic acid that selectively or 

1 0 specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set forth in SEQ ID NOS: 48 and 50, and an isolated nucleic acid that selectively hybridizes 
with a nucleic acid comprising the nucleotide sequence set forth in SEQ ID NOS: 48 and 50. 
"Selectively hybridizing" and "stringency of hybridization" is defined elsewhere herein. 

As described above, provided is the nucleic acid encoding a Rep 78 protein and, in 

15 particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 48, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 48, and a nucleic acid encoding the bovine adeno-associated virus protein 
having the amino acid sequence set forth in SEQ ID NO: 49. The present invention also 
provides the nucleic acid encoding a Rep 52 protein, and in particular an isolated nucleic 

20 acid comprising the nucleotide sequence set forth in SEQ ID NO: 50, an isolated nucleic 
acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 50, and a 
nucleic acid encoding the bovine adeno-associated virus Rep 52 protein having the amino 
acid sequence set forth in SEQ ID NO: 51. As described elsewhere herein, these nucleic 
acids can have minor modifications, including silent nucleotide substitutions, mutations 

25 causing conservative amino acid substitutions in the encoded proteins, and mutations in 
control regions that do not or minimally affect the encoded amino acid sequence. 

The present invention further provides a nucleic acid encoding the entire BAAV 
Capsid polypeptide. Furthermore, provided is a nucleic acid encoding each of the three 
BAAV coat proteins, VP1, VP2, and VP3. Thus, provided is a nucleic acid encoding BAAV 

30 VP1, a nucleic acid encoding BAAV VP2, and a nucleic acid encoding BAAV VP3. Thus, 
provided is a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 53 
rypiv a mirfMoi^ acid encoding the amino acid sequence set forth in SEQ ID NO: 55 (VP2), 
and a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 57 (VP3). 
"The present invention also specifically provides a nucleic acid comprising SEQ ID NO: 52 

35 (VP I gene); a nucleic acid comprising SEQ ID NO: 54 (VP2 gene); and a nucleic acid 

comprising SEQ ID NO: 56 (VP3 gene). The present invention also specifically provides a 
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5 nucleic acid consisting essentially of SEQ ID NO: 52 (VP1 gene), a nucleic acid consisting 
essentially of SEQ ID NO: 54 (VP2 gene), and a nucleic acid consisting essentially of SEQ 
rD NO: 56 (VP 3 gene). Minor modifications in the nucleotide sequences encoding the 
capsid, or coat, proteins are contemplated, as described above for other BAAV nucleic 
acids. However, in general, a modified nucleic acid encoding a capsid protein will have at 

10 least about 85%, about 90%, about 93%, about 95%, about 98% or 100% homology to the 
capsid nucleic sequences described herein e.g., SEQ ID NOS: 52, 54 and 56, and the capsid 
polypeptide encoded therein will have overall about 93%, about 95%, about 98%, about 
99% or 100% homology with the amino acid sequence described herein, e.g., SEQ ID NOS: 
53, 55 and 57. Nucleic acids that selectively hybridize with the nucleic acids of SEQ ID 

1 5 NOS: 52, 54 and 56 under the conditions described above are also provided. 

Provided is an isolated BAAV Rep protein. An BAAV Rep polypeptide is encoded 
by ORF1 of BAAV. The present invention also provides each individual BAAV Rep 
protein. Thus provided is BAAV Rep 52 (e.g., SEQ ID NO: 50), or a unique fragment 
thereof. Provided is BAAV Rep 78 (e.g., SEQ ID NO: 48), or a unique fragment thereof. By 

20 "unique fragment thereof is meant any smaller polypeptide fragment encoded by an BAAV 
rep gene that is of sufficient length to be found only in the Rep polypeptide. Substitutions 
and modifications of the amino acid sequence can be made as described above and, further, 
can include protein processing modifications, such as glycosylation, to the polypeptide. 
The present invention further provides a BAAV Capsid polypeptide or a unique 

25 fragment thereof. BAAV capsid polypeptide is encoded by ORF 2 of BAAV. The present 
invention further provides the individual BAAV capsid proteins, VP1, VP2 and VP 3 or 
unique fragments thereof. Thus, provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ED NO:52 (VP1). The present invention additionally provides an 
isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 54 (VP2). The 

30 present invention also provides an isolated polypeptide having the amino acid sequence set 
forth in SEQ ID NO:56 (VE3.)_By "unique fragment thereof 1 is meant any smaller 
polypeptide fragment encoded by aiav RAAv c^osid gene that is of sufficient length to be 
found only in the BAAV capsid protein. Substitutions and modifications of the amino acid 
sequence can be made as described above and, further, can include protein processing 

35 modifications, such as glycosylation, to the polypeptide. However, an BAAV Capsid 
polypeptide including all three coat proteins will have greater than about 56% overall 
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5 homology to the polypeptide encoded by the nucleotides set forth in SEQ ID NOS: 52, 54 or 
56. The protein can have about 65%, about 70%, about 75%, about 80%, about 85%, about 
90%, 93%, 95%, 97% or even 100% homology to the amino acid sequence encoded by the 
nucleotides set forth in SEQ ID NOS: 52, 54 or 56. An BAAV VP1 polypeptide can have at 
least about 58%, about 60%, about 70%, about 80%, about 90%, 93%, 95%, 97% or about 

10 100% homology to the amino acid sequence set forth in SEQ ID NO: 53. An BAAV VP2 
polypeptide can have at least about 58%, about 60%, about 70%, about 80%, about 90%, 
93%, 95%, 97% or about 100% homology to the amino acid sequence set forth in SEQ ID 
NO: 55. An BAAV VP 3 polypeptide can have at least about 60%, about 70%, about 80%, 
about 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth 

15 in SEQ ED NO: 57. 

The present invention also provides a method of producing the BAAV virus by 
transducing a cell with the nucleic acid encoding the virus. 

The present method further provides a method of delivering an exogenous 
(heterologous) nucleic acid to a cell comprising administering to the cell an BAAV particle 

20 containing a vector comprising the nucleic acid inserted between a pair of AAV inverted 
terminal repeats, thereby delivering the nucleic acid to the cell. 

The AAV ITRs in the vector for the herein described delivery methods can be AAV 
ITRs (SEQ ID NOS: 58). Furthermore, the AAV ITRs in the vector for the herein described 
nucleic acid delivery methods can also comprise AAV 1-8 or AAAV inverted terminal 

25 repeats. 

AAV Vector Generation 

It is understood that as discussed herein the use of the terms "homology" and 
"identity" mean the same thing as similarity. Thus, for example, if the use of the word 
homology is used to refer to two non-natural sequences, it is understood that this is not 
30 necessarily indicating an evolutionary relationship between these two sequences, but rather 
is looking at the similarity or relatedness between their nucleic acid sequences. Many of the 
methods for determining homology between two evolutionarily related molecules are 
routinely applied to any two or more nucleic acids or proteins for the purpose of measuring 
sequence similarity regardless of whether they are evolutionarily related. 
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5 In general, it is understood that one way to define any known variants and 

derivatives or those that might arise, of the disclosed nucleic acids and polypeptides herein, 
is through defining the variants and derivatives in terms of homology to specific known 
sequences. In general, variants of nucleic acids and polypeptides herein disclosed typically 
have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 

1 0 89, 90, 91 , 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence or the 
native sequence. Those of skill in the art readily understand how to determine the homology 
of two polypeptides or nucleic acids. For example, the homology can be calculated after 
aligning the two sequences so that the homology is at its highest level. 

Another way of calculating homology can be performed by published algorithms. 

1 5 Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search 
for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 
(1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, 

20 and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
Science Dr., Madison, WI; the BLAST algorithm of Tatusova and Madden FEMS 
Microbiol. Lett. 174: 247-250 (1999) available from the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/blast/bl2seq/bl2.html) ), or by inspection. 

The same types of homology can be obtained for nucleic acids by for example the 

25 algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. 
Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 which 
are herein incorporated by reference for at least material related to nucleic acid alignment. It 
is understood that any of the methods typically can be used and that in certain instances the 
results of these various methods may differ, but the skilled artisan understands if identity is 

30 found with at least one of these methods, the sequences would be said to have the stated 
identity. 

For example, as used herein, a secyxepce recited as having a particular percent 
homology to another sequence refers to sequences that have the recited homology as 
calculated by any one or more of the calculation methods described above. For example, a 
35 first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using the Zuker 
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5 calculation method even if the first sequence does not have 80 percent homology to the 

second sequence as calculated by any of the other calculation methods. As another example, 
a first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using both the 
Zuker calculation method and the Pearson and Lipman calculation method even if the first 

1 0 sequence does not have 80 percent homology to the second sequence as calculated by the 
Smith and Waterman calculation method, the Needleman and Wunsch calculation method, 
the Jaeger calculation methods, or any of the other calculation methods. As yet another 
example, a first sequence has 80 percent homology, as defined herein, to a second sequence 
if the first sequence is calculated to have 80 percent homology to the second sequence using 

15 each of calculation methods (although, in practice, the different calculation methods will 
often result in different calculated homology percentages). 

Stringency of hybridization is controlled by both temperature and salt concentration 
of either or both of the hybridization and washing steps. Typically, the stringency of 
hybridization to achieve selective hybridization involves hybridization in high ionic strength 

20 solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the 
melting temperature at which half of the molecules dissociate from their hybridization 
partners) followed by washing at a combination of temperature and salt concentration 
chosen so that the washing temperature is about 5°C to 20°C below the Tm. The 
temperature and salt conditions are readily determined empirically in preliminary 

25 experiments in which samples of reference DNA immobilized on filters are hybridized to a 
labeled nucleic acid of interest and then washed under conditions of different stringencies. 
Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 
hybridizations. The washing temperatures can be used as described above to achieve 
selective stringency, as is known in the art. (Sambrook et al., Molecular Cloning: A 

30 Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987). A preferable stringent 
hybridization condition for a DNA: DNA hybridization can be at about 68°C (in aqueous 
solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
and washing, if desired, can be reduced accordingly as the degree of complementarity 

35 desired is decreased, and further, depending upon the G-C or A-T richness of any area 

wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
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5 desired, can be increased accordingly as homology desired is increased, and further, 

depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 

In vivo administration to a human subject or an animal model can be by any of many 
standard means for administering viruses, depending upon the target organ, tissue or cell. 

10 Virus particles can be administered orally, parenterally (e.g., intravenously), by 

intramuscular injection, intrarectally, by direct tissue or organ injection, by intraperitoneal 
injection, topically, transdermally, via aerosol delivery, via the mucosa or the like. Viral 
nucleic acids (non-encapsidated) can also be administered, e.g., as a complex with cationic 
liposomes, or encapsulated in anionic liposomes. The present compositions can jnclude 

1 5 various amounts of the selected viral particle or non-encapsidated viral nucleic acid in 
combination with a pharmaceutically acceptable carrier and, in addition, if desired, may 
include other medicinal agents, pharmaceutical agents, carriers, adjuvants, diluents, etc. 
Parental administration, if used, is generally characterized by injection. Injectables can be 
prepared in conventional forms, either as liquid solutions or suspensions, solid forms 

20 suitable for solution or suspension in liquid prior to injection, or as emulsions. Dosages will 
depend upon the mode of administration, the disease or condition to be treated, and the 
individual subject's condition, but will be that dosage typical for and used in administration 
of other AAV vectors, such as AAV2 vectors. Often a single dose can be sufficient; 
however, the dose can be repeated if desirable. 

25 Administration of a recombinant AAV virion to the cell can be accomplished by any 

means, including simply contacting the particle, optionally contained in a desired liquid 
such as tissue culture medium, or a buffered saline solution, with the cells. The virion can 
be allowed to remain in contact with the cells for any desired length of time, and typically 
the virion is administered and allowed to remain indefinitely. For such in vitro methods, the 

30 virion can be administered to the cell by standard viral transduction methods, as known in 
the art and as exemplified herein. Titers of virus to administer can vary, particularly 
depending upon the cell type, but will be tvpical of that used for AAV transduction in 
general which is well known in the art. Additionally the titers used to transduce the 
particular cells in the present examples can be utilized. 

35 The cells that can be transduced by the present recombinant AAV virions can 

include any desired cell, such as the following cells and cells derived from the following 
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5 tissues, human as well as other mammalian tissues, such as primate, horse, sheep, goat, pig, 
dog, rat, and mouse and avian species: Adipocytes, Adenocyte, Adrenal cortex, Amnion, 
Aorta, Ascites, Astrocyte, Bladder, Bone, Bone marrow, Brain, Breast, Bronchus, Cardiac 
muscle, Cecum, Cervix, Chorion, Cochlear, Colon, Conjunctiva, Connective tissue, Cornea, 
Dermis, Duodenum, Embryonic stem cells, Endometrium, Endothelium, Endothelial cells, 

10 Epithelial tissue, Epithelial cells, Epidermis, Esophagus, Eye, Fascia, Fibroblasts, Foreskin, 
Gastric, Glial cells, Glioblast, Gonad, Hepatic cells, Histocyte, Hair cells in the inner ear, 
Ileum, Intestine, small Intestine, Jejunum, Keratinocytes, Kidney, Larynx, Leukocytes, 
Lipocyte, Liver, Lung, Lymph node, Lymphoblast, Lymphocytes, Macrophages, Mammary 
alveolar nodule, Mammary gland, Mastocyte, Maxilla, Melanocytes, Mesenchymal, 

1 5 Monocytes, Mouth, Myelin, Myoblasts Nervous tissue, Neuroblast, Neurons, Neuroglia, 
Osteoblasts, Osteogenic cells, Ovary, Palate, Pancreas, Papilloma, Peritoneum, Pituicytes, 
Pharynx, Placenta, Plasma cells, Pleura, Prostate, Rectum, Salivary gland, Skeletal muscle, 
Skin, Smooth muscle, Somatic, Spleen, Squamous, Stem cells, Stomach, Submandibular 
gland, Submaxillary gland, Synoviocytes, Testis, Thymus, Thyroid, Trabeculae, Trachea, 

20 Turbinate, Umbilical cord, Ureter, Uterus, and vestibular hair cells. 

Stringency of hybridization is controlled by both temperature and salt concentration 
of either or both of the hybridization and washing steps. Typically, the stringency of 
hybridization to achieve selective hybridization involves hybridization in high ionic strength 
solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the 

25 melting temperature at which half of the molecules dissociate from their hybridization 
partners) followed by washing at a combination of temperature and salt concentration 
chosen so that the washing temperature is about 5°C to 20°C below the Tm. The 
temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a 

30 labeled nucleic acid of interest and then washed under conditions of different stringencies. 
Hybridization temperatures-are-typically-higher-for-DNA-RNA and RNA-RNA 
hybridizations. The " ' ° ° h ; " ^^aj^ntoTaMf BmT* as described above to achieve 
selective stringency, as is known in the art. (Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd EdTTCoicTSpring Harbor Laboratory, Cold Spring Harbor, New 

35 York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987). A preferable stringent 
hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous 
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5 solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
and washing, if desired, can be reduced accordingly as the degree of complementarity 
desired is decreased, and further, depending upon the G-C or A-T richness of any area 
wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
desired, can be increased accordingly as homology desired is increased, and further, 
1 0 depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 

By the "suitability of an AAV vector for administration to a subject" is meant a 
determination of whether the AAV vector will elicit a neutralizing immune response upon 
administration to a particular subject. A vector that does not elicit a significant immune 

1 5 response is a potentially suitable vector, whereas a vector that elicits a significant, 

neutralizing immune response (e.g. at least 90%) is thus likely to be unsuitable for use in 
that subject. Significance of any detectable immune response is a standard parameter 
understood by the skilled artisan in the field. For example, one can incubate the subjects 
serum with the virus, then detemiine whether that virus retains its ability to transduce cells 

20 in culture. If such virus cannot transduce cells in culture, the vector likely has elicited a 
significant immune response. 

Alternatively, or additionally, one skilled in the art could determine whether or not 
AAV administration would be suitable for a particular cell type of a subject. For example, 
the artisan could culture muscle cells in vitro and transduce the cells with AAV in the 

25 presence or absence of the subject's serum. If there is a reduction in transduction efficiency, 
this could indicate the presence of a neutralizing antibody or other factors that may inhibit 
transduction. Normally, greater than 90% inhibition would have to be observed in order to 
rule out the use of AAV-5 as a vector. However, this limitation could be overcome by 
treating the subject with an immunosuppressant that could block the factors inhibiting 

30 transduction. 
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5 EXAMPLES 

The following examples are put forth so as to provide those of ordinary skill in the 
art with a complete disclosure and description of how the compounds, compositions, 
articles, devices and/or methods claimed herein are made and evaluated, and are intended to 
be purely exemplary of the invention and are not intended to limit the scope of what the 
10 inventors regard as their invention. Efforts have been made to ensure accuracy with respect 
to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be 
accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or 
is at ambient temperature, and pressure is at or near atmospheric. 

15 Example 1 

Previous research had demonstrated that Caco-2 and MDCK cells are model cell 
lines for the study of macromolecular transport via transcytosis. Furthermore these cell lines 
have been used to demonstrate transcytosis of both viruses and proteins. Therefore, to test if 
AAV can spread through tissue by transcytosis, 2xl0 8 DNA resistant particles of 

20 recombinant AAV2 (rAAV2) AAV4, AAV5, AAV6, BAAV suspended in 50ul of medium 
were placed in the upper (apical) side of the transwell polycarbonate filter over a monolayer 
of cells each of the following cells Caco-2, MDCKI, MDCKJI, Human primary airways 
epithelia cells (Airway), Human primary immortalized epithelial endometrial, Bovine brain 
primary endothelia cells (BBB), or HeLa. All cultures had TERs indicating the formation of 

25 tight junctions and polarized phenotype. After 3 hours of incubation the medium in the basal 
side of the transwell was collected and tested for the presence of transcytosed rAAV DNA. 
Viral DNA was extracted from 200ul of basal medium and quantified by qPCR. 

In these cell lines, transcytosis was observed with several AAV serotypes and appeared 
to be both serotype and tissue-specific (Fig. 1). Three hours after the addition of AAV to the 

30 apical surface of the cells, over 800,000 particles of AAV5 were present in the media on the 
basal lateral side of the trans-well insert of CaGo-2 cells, but not the MDCK, airway epithelia, 
endometrial, or BBB cells (Fig. !>. SesesstarijpSAAV particles were detected in the media on the 
basal lateral side of the MDCK, airways epithelia, endometrial, and BBB cells but not the Caco- 
2 cells. Interestingly, AAV4 was detected in the basal lateral media of all cell types. No virus 

35 was detected in the basal lateral media when AAV2 was added to the apical surface in either 
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5 cell type. AAV6 did not transcytose in any of cell types tested, and was not tested on airway 
epithelia or BBB. HeLa cells do not form barrier epithelia and were used as a control. 

Example 2 

Previous work has demonstrated that transcytosis is a temperature dependent process 

10 than can be inhibited at 4°C. Transcytosis can also be inhibited by the addition of agents that 
selectively fix the plasma membrane. Recently the addition of tannic acid, a mild fixative 
agent, to the basal lateral surface blocked the transcytosis of GPI-anchored proteins to the 
apical surface (Polishchuk R, Nat Cell Biol. 2004. 6(4):297-307). Therefore the ability of 
this agent to block the transcytosis of AAV was tested. Treatment of the basal lateral 

1 5 surface of either Caco-2 or MDCK cells prior to virus addition to the apical surface blocked 
the accumulation of AAV 5 or BAAV in the basal lateral media. Furthermore, quantification 
of the intracellular virus demonstrated inhibition of exocytosis by tannic acid treatment 
dramatically increase the amount of AAV DNA in the cell suggesting the viral particles 
detected in the basal lateral media are the result of an intracellular transport process and not 

20 a paracellular route. 

Treatment of the basal lateral surface of Human primary airways epithelial cell 
(HAE) with tannic acid blocked the transcytosis of BAAV or AAV4 vector containing a 
GFP expression cassette from the apical surface to the basal lateral (Fig. 2). Furthermore 
transduction dramatically increased when assayed at 24 hrs post inoculation. In contrast no 

25 change was observed in AAV2 transduction, which did not demonstrate any transcytosis 
activity and has limited binding activity on HAE. 

Example 3 

To confirm the DNA detected in the basal lateral media was indeed extracted from 
30 intact virus, the material was tested for DNase resistance after treatment with heat, ionic 
detergent or protease. The addition of DNase alone or in combination with the ionic 
detergent deoxycholine had no effect on the viral DNA present in the media suggesting it 
was not free DNA or complexed in lipid vesicles. However, heating to 95°C prior to 
treatment with DNAase completely degraded the viral DNA present in the media. This 
35 profile is identical to that of the input AAV particles and suggests the viral DNA is still 
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5 encapsulated. Titration of the DNase resistant virus in the basal lateral media on Cos cells 
gave a similar particle to infectivity ratio to the input AAV particles. 

While it would appear the AAV DNA detected in the basal lateral media is 
contained in intact particles, its presence on the basal lateral surface could be the result of 
lyses of the cells or disruption of the monolayer. Therefore the TER was carefully monitored 

10 throughout the course of these experiments and was not observed to decrease. To further 
confirm the integrity of the cell monolayer, mixing experiments were studied in which two 
viruses with different gene cassettes were added to the apical surface at the same time and 
three hours post addition the amount of each virus in the basal lateral media was quantified 
using QPCR specific for each cassette. Both BAAV and AAV5 were able to pass from the 

15 apical to the basal lateral surface ofMDCK or Caco cells respectively but the AAV2 did 

not. Therefore the presence of viral particles in the basal lateral media does not appear to be 
the result of a disruption in the cell monolayer. 

Taken together this data suggest that dependoviruses particles are capable of passing 
through barrier epithelia via transcytosis and the process is both serotype and cell type 

20 specific. 

Example 4 

To further characterize the transcytosis activity observed with AAV5 and BAAV, 
transcytosis was quantified as both a time and concentration dependent event. After the 

25 addition of particles to the apical surface, samples were removed from the basal lateral 
media at different time points and the amount of virus was quantified by QPCR of the 
extracted DNA. Viral genomes could be detected as soon as 30 minutes after addition and 
steadily increased with time By 24 hrs, over 1/3 of the input recombinant AAV5, 
BAAVvirus added to Caco or MDCK cells respectively had been transported to the basal 

30 lateral surface. In contrast, none of the input AAV2 or adenovirus was detected on the basal 
lateral side after 24 hrs. 

If transcytosis is an activity used bv AAV to spread through tissue, this finding 
would help explain the lack of transduction of barrier epithelia reported with some isolates 
of AAV. Primary human bronchial airway epithelia (HAE) are known to transport albumin 

35 from the apical to the basal lateral surface by receptor-mediated transcytosis in vivo. While 
the interaction of BAAV with primary HAE has not been investigated, AAV4, 5 are 
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5 reported to bind to HAE, however, for AAV4, this interaction does not result in 

transduction. Because of the interaction of AAV4 with O-link sialic acid, it was proposed, 
and has been demonstrated, that mucins, which contained large amounts of O-linked sialic 
acid and are expressed on the apical surface of HAE, can block AAV4 transduction. 
Alternatively the lack of transduction could be the result of transcytosis of the virus through 
10 the tissue. 

To test this hypothesis, AAV2, 4, 5, BAAV were added to the apical surface of 
confluent monolayer cultures of primary human bronchial airway and transcytosis to the 
basal lateral surface was measured by QPCR after 3 hrs. All cultures had high TERs and 
expressed ciliated structures on their apical surface. Highly differentiated HAE cultures in 
1 5 contrast to immature cultures are resistant to transduction by adenoviral vectors due to a 
lack of integrin expression that is necessary for adenovirus entry. 

Of the 4 AAVs tested for transcytosis, AAV4 and BAAV were detected in the basal 
lateral media. No transport of AAV2 or AAV5 was detected. As a control, adenovirus also 
was tested for transcytosis activity in the HAE cultures, but no transport was detected. 

20 

Example 5 

Epithelial cells that line the genitourinary tract form an important epithelial barrier 
layer and can transport proteins by transcytosis. AAV2, 4, 5 or BAAV were therefore tested 
to determine for the ability to penetrate this barrier epithelial layer by transcytosis. A well- 
25 characterized model of endometrial cells has been reported by Kyo et al. Following addition 
of the 4 AAVs to the apical surface, BAAV and AAV4 could be detected in the basal lateral 
media when assayed at 3hrs post inoculation (Fig. 1). 

Example 6 

30 Most AAVs were identified originally as contaminants of laboratory stocks of 

—adenovirus, thus our understanding of their natural biology, cell tropism, and knowledge the 
r^Ut^y coruDonents required for virus entry is limited. For AAV5, in addition to N-linked 
sialic acid, the platelet derived growth factor (PDGF) receptors were identified as protein 
receptors for AAVS (Di Pasquale et al., Nat Med. 2003 Oct;9(10):1306-12). This interaction 

35 was confirmed by modulation of PDGFR expression by transfection of expression plasmids, 
inhibitor treatment, or competition experiments with the extracellular domain of PDGFRa. 
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5 Likewise AAV5 transduction could be blocked with sialolactosamine conjugates kaludov et 
al 2001. 

Previous research had demonstrated that transcytosis is actin dependent and occurs 
by a caviolin mediated pathway. Furthermore transcytosis can be blocked by treatment with 
tannic acid. Therefore to better characterize the transcytosis pathway utilized by AAV 5 in 

1 0 Caco cells the cells were treated with a panel of agents known to block either transcytosis in 
other systems or AAV5 mediated transduction. It was noted that AAV5 transcytosis could 
be inhibited by filipin and nocozodol as well as treatment with tannic acid. 

Caco cells, which actively transcytosis AAV5, are not reported to express PDGFR 
and are not transduced by AAV5. In agreement, competition experiments with sPDGFRa 

15 had little effect on AAV5 transcytosis. Furthermore, competition experiments with 200 
ug/ml sialolactosamine or 200 ug/ml heparin did not inhibited AAV5 transcytosis. 

Both BSA and transferrin are reported to transcytosis through Caco cells via distinct 
receptor mediated pathways. However competition with either agent did not inhibit AAV5 
transcytosis suggesting the AAV5 could use a distinct pathway. 

20 In addition to confirming the intracellular nature of AAV5 transcytosis in Caco cells, 

the above experiments suggest that AAV5 transcytosis is occurring by a pathway 
independent of the one described for transduction. To confirm this Caco cells were stably 
transfected with PDGFRa and assayed for both transcytosis and transduction activity. Caco 
cells were not permissive for AAV5 transduction, however transduction dramatically 

25 increase following stable expression of PDGFRa. In contrast only a minor increase in 
transcytosis activity was detected in the Caco/PDGFRa cells. 

Throughout this application, various publications are referenced. The disclosures of 
these publications in their entireties are hereby incorporated by reference into this 
application in order to more fully describe the state of the art to which this invention 

30 pertains. 

It will be apparent-to-those skilled-in-the-art-that various modifications and variations 
can be made in the present m^S^k^ w rititeWtf^dr^ arfitl P from the scope or spirit of the 
invention. Other embodiments of the invention will be apparent to those skilled in the art 
from consideration of the specification and practice of the invention disclosed herein. It is 
35 intended that the specification and examples be considered as exemplary only, with a true 
scope and spirit of the invention being indicated by the following claims. 
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What is claimed is: 

1. A method of delivering a heterologous nucleic acid across an epithelial barrier 
comprising delivering to the epithelial barrier an AAV vector comprising the 
heterologous nucleic acid. 

2. The method of claim 1, wherein the epithelial cells are in the gut, lung, genitourinary 
tract, kidney, blood vessels or brain. 

3. The method of claim 1, wherein the epithelial cells can be selected from a group 
consisting of bronchial, alveolar, tracheal or upper airway epithelial cells; absorptive 
enterocytes; endometrial or urinary epithelial cells; renal collecting duct or proximal 
tubule epithelial cells; cerebral microvascular endothelial cells; or Choroidal Plexus 
epithelial cells . 

4. A method of transcytosing epithelial cells of a human subject comprising 
administering to the subject an AAV vector comprising a heterologous nucleic acid. 

5. The method of claim 4, wherein the epithelial cells are selected from a group 
consisting of bronchial, alveolar, tracheal or upper airway epithelial cells; absorptive 
enterocytes; endometrial or urinary epithelial cells; renal collecting duct or proximal 
tubule epithelial eels; cerebral microvascular endothelial cells; or Choroidal Plexus 
epithelial cells. 

6. A method of delivering a heterologous nucleic acid across human airway epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

7. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a BAAV vector 
comprising the nucleic acid. 

8. A method of delivering a heterologous nucleic acid across human endometrial 
epithelial cells, comprising delivering to the cells a BAAV vector comprising the 
nucleic acid. 

9. A method of delivering a heterologous nucleic acid across human kidney epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

10. A method of delivering a heterologous nucleic acid across human enterocytes, 
comprising delivering to the cells a AAV5 vector comprising the nucleic acid. 
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11. A method of delivering a heterologous nucleic acid across human airway epithelial 
cells, comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

12. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a AAV4 vector 
comprising the nucleic acid. 

13. A method of delivering a heterologous nucleic acid across human endometrial 
epithelial cells, comprising delivering to the cells a AAV4 vector comprising the 
nucleic acid. 

14. A method of delivering a heterologous nucleic acid across human kidney epithelial 
cells, comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

15. A method of delivering a heterologous nucleic acid across human enterocytes 
comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

16. A method of delivering a heterologous nucleic acid across an epithelial barrier of the 
lung, comprising delivering to the lung a BAAV vector comprising the nucleic acid. 

1 7. The method of claim 16, wherein the epithelial barrier comprises human bronchial, 
alveolar, tracheal or upper airway epithelial cells. 

1 8. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
brain, comprising delivering to the brain a BAAV vector comprising the nucleic 
acid. 

1 9. The method of claim 1 8, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

20. A method of delivering a heterologous nucleic acid across the epithelial barrier of 
blood vessels into the muscle, comprising delivering to the blood stream a BAAV 
vector comprising the nucleic acid. 

21 . The method of claim_20, wherein_the_epithelial barrier comprises human vascular 
endothelial cells of the blood bnain Huuxier- 

22. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
genitourinary tract, comprising delivering to the genitourinary tract a BAAV vector 
comprising the nucleic acid genitourinary tract. 
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23. The method of claim 22, wherein the epithelial barrier comprises human endometrial 
or urinary epithelial cells. 

24. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
kidney, comprising delivering to the genitourinary tract a BAAV vector comprising 
the nucleic acid genitourinary tract. 

25. The method of claim 24, wherein the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. 

26. A method of transcytosing lung epithelial cells of a subject comprising contacting 
the lung epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. 

27. The method of claim 26, wherein the epithelial cells are human bronchial, tracheal, 
or upper airway epithelial cells. 

28. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. 

29. The method of claim 28, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

30. A method of transcytosing vascular epithelial cells of a subject comprising 
contacting the vascular epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. 

31. The method of claim 30, wherein the epithelial cells are human vascular endothelial 
cells of the blood brain barrier. 

32. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the genitourinary tract epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. 

33. The method of claim 32, wherein the epithelial cells are human endometrial or 
urinary tract epithelial cells. 

34. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the kidney epithelial cells of the subject with a BAAV vector comprising 
a heterologous nucleic acid. 

70 

233334_17.DOC 



ATTORNEY DOCKET NO. 14014.0427U1 



35. The method of claim 34, wherein the epithelial cells are human renal collecting 
ducts or proximal tubules 

36. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
gut, comprising delivering to the gut an AAV5 vector comprising the nucleic acid. 

37. The method of claim 36, wherein the epithelial barrier comprises human absorptive 
enterocytes. 

38. A method of transcytosing gut epithelial cells of a subject comprising contacting the 
gut epithelial cells of the subject with an AAV 5 vector comprising a heterologous 
nucleic acid. 

39. The method of claim 38, wherein the epithelial cells are human absorptive 
enterocytes. 

40. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
gut, comprising delivering to the gut an AAV4 vector comprising the nucleic acid. 

41 . The method of claim 40, wherein the epithelial barrier comprises human absorptive 
enterocytes. 

42. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
lung, comprising delivering to the lung an AAV4 vector comprising the nucleic acid. 

43. The method of claim 42, wherein the epithelial barrier comprises human bronchial, 
tracheal, or upper airway epithelial cells. 

44. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
CNS, comprising delivering to the CNS an AAV4 vector comprising the nucleic 
acid. 

45. The method of claim 44, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

46. A method of delivering a heterologous nucleic acid across the epithelial barrier of 
blood vessels into the muscle, comprising delivering to the blood stream an AAV4 
vector comprising the nucleic acid. 

47. The method of claim 46, wherein the epithelial barrier comprises human vascular 
endothelial cells of the blood brain barrier. 
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48. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
genitourinary tract, comprising delivering to the genitourinary tract an AAV4 vector 
comprising the nucleic acid. 

49. The method of claim 48, wherein the epithelial barrier comprises human endometrial 
or urinary epithelial cells. 

50. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
kidneys, comprising delivering to the kidneys an AAV4 vector comprising the 
nucleic acid. 

5 1 . The method of claim 50, wherein the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. 

52. A method of transcytosing lung epithelial cells of a subject comprising contacting 
the lung epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

53. The method of claim 52, wherein the epithelial cells are human bronchial, tracheal, 
or upper airway epithelial cells. 

54. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

55. The method of claim 54, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

56. A method of transcytosing vascular epithelial cells of a subject comprising 
contacting the vascular epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. 

57. The method of claim 56, wherein the epithelial cells are vascular endothelial cells of 
the blood brain barrier. 

-58.. A method of-transcy tosing genitourinary tract epithelial cells of a subject comprising 
contacting the genitourinary epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. 
59. The method of claim 58, wherein the epithelial cells are human endometrial or 
urinary epithelial cells. 
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60. A method of transcytosing kidney epithelial cells of a subject comprising contacting 
the kidney epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid, 

61 . The method of claim 60, wherein the epithelial cells are human renal collecting 
ducts or proximal tubules 

62. A method of transcytosing gut epithelial cells of a subject comprising contacting the 
CNS epithelial cells of the subject with an AAV4 vector comprising a heterologous 
nucleic acid. 

63. The method of claim 62, wherein the epithelial cells are human absorptive 
enterocytes. 
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ABSTRACT OF THE DISCLOSURE 

The present invention provides methods of transcytosing barrier epithelial cells using 
adeno-associated virus-4 (AAV-4), adeno-associated virus- 5 (AAV5), bovine adeno- 
associated virus (BAAV), and vectors and particles derived therefrom. In addition, the 
present invention provides methods of delivering a nucleic acid across the barrier epithelia 
using the AAV4, AAV5, and BAAV vectors and particles. 
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Fig. 1) AAV Transcytosis in Different Endo and Epithelial cells 
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SEQ ID NO si 
AAV 4 genome 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctggagacc aaaggtctcc 60 

agactgccgg cctctggccg gcagggccga gtgagtgagc gagcgcgcat agagggagtg 12 0 

gccaactcca tcatctaggt ttgcccactg acgtcaatgt gacgtcctag ggttagggag 180 

gtccctgtat tagcagtcac gtgagtgtcg tatttcgcgg agcgtagcgg agcgcatacc 240 

aagctgccac gtcacagcca cgtggtccgt ttgcgacagt ttgcgacacc atgtggtcag 300 

gagggtatat aaccgcgagt gagccagcga ggagctccat tttgcccgcg aattttgaac 360 

gagcagcagc catgccgggg ttctacgaga tcgtgctgaa ggtgcccagc gacctggacg 420 

agcacctgcc cggcatttct gacfccttttg tgagctgggt ggccgagaag gaatgggagc 480 

tgccgccgga ttctgacatg gacttgaatc tgattgagca ggcacccctg accgtggccg 540 

aaaagctgca acgcgagttc ctggtcgagt ggcgccgcgt gagtaaggcc ccggaggccc 600 

tcttctttgt ccagttcgag aagggggaca gctacttcca cctgcacatc ctggtggaga 660 

ccgtgggcgt caaatccatg gtggtgggcc gctacgtgag ccagattaaa gagaagctgg 720 

tgacccgcat ctaccgcggg gtcgagccgc agcttccgaa ctggttcgcg gtgaccaaga 780 

cgcgtaatgg cgccggaggc gggaacaagg tggtggacga ctgctacatc cccaactacc 84 0 

tgctccccaa gacccagccc gagctccagt gggcgtggac taacatggac cagtatataa 900 

gcgcctgttt gaatctcgcg gagcgtaaac ggctggtggc gcagcatctg acgcacgtgt 960 

cgcagacgca ggagcagaac aaggaaaacc agaaccccaa ttctgacgcg ccggtcatca 102 0 

ggtcaaaaac ctccgccagg tacatggagc tggtcgggtg gctggtggac cgcgggatca 1080 

cgtcagaaaa gcaatggatc caggaggacc aggcgtccta catctccttc aacgccgcct 1140 

ccaactcgcg gtcacaaatc aaggccgcgc tggacaatgc ctccaaaatc atgagcctga 1200 

caaagacggc tccggactac ctggtgggcc agaacccgcc ggaggacatt tccagcaacc 1260 

gcatctaccg aatcctcgag atgaacgggt acgatccgca gtacgcggcc tccgtcttcc 1320 

tgggctgggc gcaaaagaag ttcgggaaga ggaacaccat ctggctcttt gggccggcca 1380 

cgacgggtaa aaccaacatc gcggaagcca tcgcccacgc cgtgcccttc tacggctgcg 1440 

tgaactggac caatgagaac tttccgttca acgattgcgt cgacaagatg gtgatctggt 150O 

gggaggaggg caagatgacg gccaaggtcg tagagagcgc caaggccatc ctgggcggaa 1560 

gcaaggtgcg cgtggaccaa aagtgcaagt catcggccca gatcgaccca actcccgtga 1620 

tcgtcacctc caacaccaac atgtgcgcgg tcatcgacgg aaactcgacc accttcgagc 1680 

accaacaacc actccaggac cggatgttca agttcgagct caccaagcgc ctggagcacg 1740 

actttggcaa ggtcaccaag caggaagtca aagacttttt ccggtgggcg tcagatcacg 1800 

tgaccgaggt gactcacgag ttttacgtca gaaagggtgg agctagaaag aggcccgccc 1860 

ccaatgacgc agatataagt gagcccaagc gggcctgtcc gtcagttgcg cagccatcga 1920 

cgtcagacgc ggaagctccg gtggactacg cggacaggta ccaaaacaaa tgttctcgtc 1980 

acgtgggtat gaatctgatg ctttttccct gccggcaatg cgagagaatg aatcagaatg 204 0 

tggacatttg cttcacgcac ggggtcatgg actgtgccga gtgcttcccc gtgtcagaat 2100 

ctcaacccgt gtctgtcgtc agaaagcgga cgtatcagaa actgtgtccg attcatcaca 2160 

tcatggggag ggcgcccgag gtggcctgct cggcctgcga actggccaat gtggacttgg 222 0 

atgactgtga catggaacaa taaatgactc aaaccagata tgactgacgg ttaccttcca 2280 

gattggctag aggacaacct ctctgaaggc gttcgagagt ggtgggcgct gcaacctgga 2340 

gcccctaaac ccaaggcaaa tcaacaacat caggacaacg ctcggggtct tgtgcttccg 2400 

ggttacaaat acctcggacc cggcaacgga ctcgacaagg gggaacccgt caacgcagcg 24 60 

gacgcggcag ccctcgagca cgacaaggcc tacgaccagc agctcaaggc cggtgacaac 252 0 

ccctacctca agtacaacca cgccgacgcg gagttccagc agcggcttca gggcgacaca 2580 

ccgtttgggg gcaacctcgg cagagcagtc ttccaggcca aaaagagggt tcttgaacct 2640 

cttggtctgg ttgagcaagc gggtgagacg gctcctggaa agaagagacc gttgattgaa 2700 

tccccccagc agcccgactc ctccacgggt atcggcaaaa aaggcaagca gccggctaaa 2760 

aagaagctcg ttttcgaaga cgaaactgga gcaggcgacg gaccccctga gggatcaact 2820 

tccggagcca tgtctgatga cagtgagatg cgtgcagcag ctggcggagc tgcagtcgag 2880 

ggsggacaag gtgccgatgg agtgggtaat gcctcgggtg attggcattg cgattccacc 2 94 0 

tggtctgagg gccacgtcac gaccaccagc accagaacct gggtcttgcc cacctacaac 3 000 

aaccacctnt acaagcgact cggagagagc ctgcagtcca acacctacaa cggattctcc 3 060 

accccctggg gatactttga cttcaaccgc ttccactgcc acttctcacc acgtgactgg 3120 

cagcgactca tcaacaacaa ctggggc at g c^garrccaaag ccatgcgggt caaaatcttc 318C 

aacatccagg tcaaggaggt cacgacgtcg aacggcgaga caacggtggc taataacctt 3240 

accagcacgg ttcagatctt tgcggactcg tcgtacgaac tgccgtacgt gatggatgcg 3300 

ggtcaagagg gcagcctgcc tccttttccc aacgacgtct ttatggtgcc ccagtacggc 3360 

tactgtggac tggtgaccgg caacacttcg cagcaacaga ctgacagaaa tgccttctac 3420 

tgcctggagt actttccttc gcagatgctg cggactggca acaactttga aattacgtac 3480 

agttttgaga aggtgccttt ccactcgatg tacgcgcaca gccagagcct ggaccggctg 3 54 0 

atgaaccctc tcatcgacca gtacctgtgg ggactgcaat cgaccaccac cggaaccacc 3600 

ctgaatgccg ggactgccac caccaacttt accaagctgc ggcctaccaa cttttccaac 3660 
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tttaaaaaga 
aatcaaaact 
agcactctgg 
cctgcggaca 
aacacggcca 
aacgccaccg 
ctgccgaccg 
agagacattt 
cacccctcac 
aagaacaccc 
ttcattactc 
gagcggtcca 
tctctgttgt 
cgctacctca 
gttgaacttt 
taagcagcgg 
tctggcaaac 
ggccctggag 
agcgagcgcg 



actggctgcc 
acaagatccc 
acggaagatg 
gcaagttcag 
ccgtacccgg 
atacggacat 
tggacagact 
actaccaggg 
cgctgattgg 
cggtacctgc 
agtacagcac 
aacgctggaa 
gggctcccga 
cccaccacct 
ggtctccgtg 
cctgcggcgc 
catgatgatg 
accaaaggtc 
catagaggga 



cgggccttca 
tgccaccggg 
gagtgccctg 
caacagccag 
gactctgatc 
gtggggcaac 
gacagccttg 
tcccatttgg 
tgggtttggg 
gaatcctgca 
tggccaggtg 
ccccgaggtc 
tgcggctggg 
gtaataacct 
tccttcttat 
ttgcgcttcg 
gagttggcca 
tccagactgc 
gtggccaa 



atcaagcagc 
tcagacagtc 
acccccggac 
ctcatctttg 
ttcacctctg 
ctacctggcg 
ggagccgtgc 
gccaagattc 
ctgaaacacc 
acgaccttca 
tcggtgcaga 
cagtttacct 
aaatacactg 
gttaatcaat 
cttatctcgt 
cggtttacaa 
ctccctctat 
cggcctctgg 



agggcttctc 
tcatcaaata 
ctccaatggc 
cggggcctaa 
aggaggagct 
gtgaccagag 
ctggaatggt 
ctcataccga 
cgcctcctca 
gctctactcc 
ttgactggga 
ccaactacgg 
agcctagggc 
aaaccggttt 
ttccatggct 
ctgccggtta 
gcgcgctcgc 
ccggcagggc 



aaagactgcc 
cgagacgcac 
cacggctgga 
acagaacggc 
ggcagccacc 
caacagcaac 
ctggcaaaac 
tggacacttt 
aatttttatc 
ggtaaactcc 
gatccagaag 
acagcaaaac 
tatcggtacc 
attcgtttca 
actgcgtaca 
atcagtaact 
tcactcactc 
cgagtgagtg 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4768 



SEQ ID NO: 2 

AAV 4 Rep protein (full length) 



Met 


Pro 


Gly 


Pne 


Tyr 


G1U 


x xe 


vai 


1 








5 








Glu 


His 


Leu 


Pro 


Gly 


lie 


Ser 


Asp 








20 










Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 






35 










4 0 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


val 


Ala 




50 










55 




Val 


Glu 


Trp 


Arg 


Arg 


val 


ser 


Lys 


65 










7 0 






Gin 


Phe 


Glu 


Lys 


Gly 


Asp 


Ser 


Tyr 










85 








Thr 


Val 


Gly 


Val 


Lys 


Ser 


Met 


Val 








100 










Lys 


Glu 


Lys 


Leu 


Val 


Thr 


Arg 


lie 






115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 




130 










135 




Asn 


Lys 


Val 


Val 


Asp 


Asp 


Cys 


Tyr 


145 










150 . 






Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 










165 








ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Ala 


Glu 






180 










Leu 


Thr 


His 


Val 


Ser 


Gin 


Thr 


Gin 






195 










200 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


lie 




210 










215 




Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


225 










230 






Gin 


Trp 


lie 


Gin 


Glu 


Asp 


Gin 


Ala 










245 








Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


lie 


Lys 








260 










lie 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 






275 










280 


Pro 


Pro 


Glu 


Asp 


lie 


Ser 


Ser 


Asn 




290 










295 




Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


305 










310 







Leu 


Lys 


vai 


Pro 


Ser 


Asp 


Leu 


Asp 




10 










J- 3 




Ser 


Pne 


va jl 


Ser 


Trp 


vol 


n. J. Cl 


Glu 


2 5 










■3 u 






C a n- 

ser 


Asp 




Asp 


Leu 


Asn 


Leu. 


He 










4 b 








pi,, 


Lys 


Leu 


f"2l r*l 


Arg 




Phe 


Leu 








60 










M. J- cL 


Pro 


m ii 

UlU 




Leu 


Phe 


Phe 


Val 






75 










80 


Phe 


His 


Leu 


His 


He 


Leu 


Val 


Glu 




90 










95 




Val 


Gly 


Arg 


Tyr 


Val 


Ser 


Gin 


He 


105 










110 






Tyr 


Arg 


Gly 


Val 


Glu 


Pro 


Gin 


Leu 










125 








Thr 


Arg 


Asn 


Gly 


Ala 


Gly 


Gly 


Gly 








140 










lie 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 






155 










160 


Trp 


Thr 


Asn 


Met 


Asp 


Gin 


Tyr 


He 




170 










175 




Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 


His 


185 










190 






Glu 


Gin 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 










205 








Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 








220 










Asp 


Arg 


Gly 


He 


Thr 


Ser 


Glu 


Lys 






235 










240 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 




250 










255 




Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Ser 


Lys 


265 










270 






Pro 


Asp 


Tyr 


Leu 


val 


Gly 


Gin 


Asn 










285 








Arg 


He 


Tyr 


Arg 


He 


Leu 


Glu 


Met 








300 










Ala 


Ser 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 



315 320 
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UlU 




Lys 


Phe 


1 tin 


1 111 


u iy 


Lys 










Phe 


Tyr 


Gly 


Cys 






355 




Cys 


va j. 


Asp 


Lys 




*a *7 n 






Lys 


va 1 


VOX 


Glu 


ope 








veil 


Asp 


Oi.Il 


T .\rez 


Tip 


Val 


Thr 


Ser 








A "5 rt 
ft x U 


Thr 


Thr 


Phe 


Glu 






435 






Leu 


i nir 


Lys 




a c r\ 
4 b U 






C?1U 


va x 


Lys 


Asp 


A £Z C 
no J 








TVrr 
x m 


nio 


Glu 


Phe 




sn 


S P 


Ala 










Ala 


Gin 


Pro 


Ser 






515 




Arg 


Tyr 


bin 


Asn 










pne 


Pro 


Cys 


Arg 


54 5 








Till 


l nr 


n.1 b 


u±y 


Ser 


Gin 


Pro 


Val 








580 


Pro 


lie 


His 


His 






595 




Cys 


Glu 


Leu 


Ala 




610 







Gly Lys Arg Asn 
325 

Thr Asn lie Ala 

Val Asn Trp Thr 
360 

Met Val lie Trp 
375 

Ser Ala Lys Ala 
390 

Cys Lys Ser Ser 
405 

Asn Thr Asn Met 

His Gin Gin Pro 
440 

Arg Leu Glu His 
4 55 

Phe Phe Arg Trp 
470 

Tyr Val Arg Lys 
485 

Asp lie Ser Glu 

Thr Ser Asp Ala 
520 

Lys Cys Ser Arg 
535 

Gin Cys Glu Arg 
550 

Val Met Asp Cys 
565 

Ser Val Val Arg 

lie Met Gly Arg 
600 

Asn Val Asp Leu 
615 



Thr lie Trp Leu 
330 

Glu Ala lie Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 
380 

lie Leu Gly Gly 
395 

Ala Gin lie Asp 
410 

Cys Ala Val He 
425 

Leu Gin Asp Arg 

Asp Phe Gly Lys 
460 

Ala Ser Asp His 
475 

Gly Gly Ala Arg 
4 90 

Pro Lys Arg Ala 
505 

Glu Ala Pro Val 

His Val Gly Met 
540 

Met Asn Gin Asn 
555 

Ala Glu Cys Phe 
570 

Lys Arg Thr Tyr 
585 

Ala Pro Glu Val 

Asp Asp Cys Asp 
620 



Phe Gly Pro Ala 
335 

His Ala Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 

Ser Lys Val Arg 
400 

Pro Thr Pro Val 
415 

Asp Gly Asn Ser 
430 

Met Phe Lys Phe 
445 

Val Thr Lys Gin 

val Thr Glu Val 
480 

Lys Arg Pro Ala 
495 

Cys Pro Ser Val 
510 

Asp Tyr Ala Asp 
525 

Asn Leu Met Leu 

Val Asp He Cys 
560 

Pro Val Ser Glu 
575 

Gin Lys Leu Cys 
590 

Ala Cys Ser Ala 
605 

Met Glu Gin * 



SEQ ID NO: 3 

AAV4 Rep gene (full length) 



ATG 


CCG 


GGG 


TTC 


TAC 


GAG 


ATC 


GTG 


CTG 


AAG 


GTG 


CCC 


AGC 


GAC 


CTG 


GAC 


48 
Met 


Pro 


Gly 


Phe 


Tyr 


Glu 


lie 


val 


Leu 


Lys 


Val 


Pro 


Ser 


Asp 


Leu 


Asp 


1 






5 










10 










15 




GAG 


CAC 


CTG 


CCC 


GGC 


ATT 


TCT 


GAC 


TCT 


TTT 


GTG 


AGC 


TGG 


GTG 


GCC 


GAG 


96 
Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe- 


Val 


Ser 


Trp 


Val 


Ala 


Glu 








20 








25 










30 






AAG 


GAA 


TGG 


GAG 


CTG 












ATG 


GAC 


TTG 


AAT 


CTG 


ATT 


144 






























He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 




35 










40 










45 








GAG 


CAG 


GCA 


CCC 


CTG 


ACC 


GTG 


GCC 


GAA 


AAG 


CTG 


CAA 


CGC 


GAG 


TTC 


CTG 


192 
Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin 


Arg 


Glu 


Phe 


Leu 




50 










55 










60 










GTC 


GAG 


TGG 


CGC 


CGC 


GTG 


AGT 


AAG 


GCC 


CCG 


GAG 


GCC 


CTC 


TTC 


TTT 


GTC 
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24 0 










vai 


Glu 


Trp 


ta 

Arg 


Arg 


65 










v,AU 


TTC 


nan 






288 










Gin 


pne 




Lys 












0 b 


ACL 




C3L7C 




a a a 


336 










Thr 


vai 


Gly 


Val 


Lys 








10 0 




AAA 


GAG 


AAG 


C 1G 




3 84 










Lys 


GlU 


Lys 


Leu 


vai 






115 






CC*J 




1 VjVj 


TTC 




4 32 










Pro 


Asn 


Trp 


Til— ^ 

Pne 


Aia 




13 0 








AAC 


AAG 


GTG 


GTG 


GAC 


480 










Asn 


Lys 


val 


vai 


Asp 


145 










ACC 


CAG 


CCC 


GAG 


CTC 


528 










Thr* 


Gin 


Pro 


Glu 


Leu 










16 5 


AGC 


GCC 


TGT 


TTG 


AAI 


576 










Ser 


Ala 


Cys 


Leu 


Asn 








180 




CTG 


ACG 


CAC 


GTG 


TCG 


624 










Leu 


Thr 


His 


Val 


Ser 






195 






CCC 


AAT 


TCT 


GAC 


GCG 


672 










Pro 


Asn 


Ser 


Asp 


Ala 




210 








ATG 


GAG 


CTG 


GTC 


GGb 


720 










Met 


Glu 


Leu 


vai 


Ciiy 


225 










CAA 


TGG 


ATC 


CAG 


GAG 


768 










Gin 


Trp 


lie 


Gin 


Glu 










245 


TCC 


AAC 


TCG 


CGG 


TCA 


816 










Ser 


Asn 


Ser 


Arg 


Ser 








2 60 




ATC 


ATG 


AGC 


CTG 


TV /"»7V 

ALA 


864 










lie 


Met 


Ser 


Leu 


Thr 






27 5 






CCG 


CCG 


VjAvj 




ATT 


912 










Pro 


Pro 


Glu 


Asp 


lie 




290 








AAC 


GGG 


TAC 


GAT 


CCG 


960 










Asn 


Gly 


Tyr 


Asp 


Pro 


305 










CAA 


AAG 


AAG 


TTC 


GGG 
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Val 


Ser 


Lys 


Ala 


Pro 


70 










GAC 


AGC 


TAC 


TTC 


CAC 


Asp 


Ser 


Tyr 


Phe 


His 










90 


± V— L- 




VJlU 


VJlU 


GGC 


Ser 


Met 


Val 


val 


Gly 








105 






V-VJV* 


ATC 


TAC 


CGC 


Thr 


Arg 


lie 


Tyr 


Arg 






120 








nv»v, 


AAG 


ACG 


CGT 


val 


Thr 


Lys 


Thr 


Arg 




13 5 










TCZC 


TAC 


ATC 


CCC 


Asp 


Cys 


Tyr 


He 


Pro 


150 










CAG 


TGG 


GCG 


TGG 


ACT 


Gin 


Trp 


Ala 


Trp 


Thr 










170 


CTC 


GCG 


GAG 


CGT 


AAA 


Leu 


Ala 


Glu 


Arg 


Lys 



185 

CAG ACG CAG GAG CAG 

Gin Thr Gin Glu Gin 
200 

CCG GTC ATC AGG TCA 

Pro Val He Arg Ser 
215 

TGG CTG GTG GAC CGC 

Trp Leu Val Asp Arg 
230 

GAC CAG GCG TCC TAC 

Asp Gin Ala Ser Tyr 
250 

CAA ATC AAG GCC GCG 

Gin lie Lys Ala Ala 
265 

AAG ACG GCT CCG GAC 

Lys Thr Ala Pro Asp 
280 

TCC AGC AAC CGC ATC 

Ser Ser Asn Arg lie 
295 

CAG TAC GCG GCC TCC 

Gin Tyr Ala Ala Ser 
310 

AAG AGG AAC ACC ATC 
78 



GlU 


Ala 


Leu 


Phe 


Phe 


Val 


75 










80 


CTG 


CAC 


ATC 


CTG 


GTG 


GAG 


Leu 


His 


He 


Leu 


Val 


Glu 










95 




CGC 


TAC 


GTG 


AGC 


CAG 


ATT 


Arg 


Tyr 


Val 


Ser 


Gin 


He 








110 






GGG 


GTC 


GAG 


CCG 


CAG 


CTT 


Gly 


Val 


Glu 


Pro 


Gin 


Leu 






125 








AAT 


GGC 


GCC 


GGA 


GGC 


GGG 


Asn 


Gly 


Ala 


Gly 


Gly Gly 




140 










AAC 


TAC 


CTG 


CTC 


CCC 


AAG 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


155 










160 


AAC 


ATG 


GAC 


CAG 


TAT 


ATA 


Asn 


Met 


Asp 


Gin 


Tyr 


He 










175 




CGG 


CTG 


GTG 


GCG 


CAG 


CAT 


Arg 


Leu 


Val 


Ala 


Gin 


His 








190 






AAC 


AAG 


GAA 


AAC 


CAG 


AAC 


Asn 


Lys 


Glu 


Asn 


Gin 


Asn 



205 



AAA 


ACC 


TCC 


GCC 


AGG 


TAC 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 




220 










GGG 


ATC 


ACG 


TCA 


GAA 


AAG 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


235 










240 


ATC 


TCC 


TTC 


AAC 


GCC 


GCC 


He 


Ser 


Phe 


Asn 


Ala 


Ala 










255 




CTG 


GAC 


AAT 


GCC 


TCC 


AAA 


Leu 


Asp 


Asn 


Ala 


Ser 


Lys 








270 






TAC 


CTG 


GTG 


GGC 


CAG 


AAC 


Tyr 


Leu 


Val 


Gly 


Gin 


Asn 






285 








TAC 


CGA 


ATC 


CTC 


GAG 


ATG 


Tyr 


Arg 


He 


Leu 


Glu 


Met 




300 










GTC 


TTC 


CTG 


GGC 


TGG 


GCG 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 


315 










320 


TGG 


CTC 


TTT 


GGG 


CCG 


GCC 
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1008 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

ACG ACG GGT AAA ACC AAC ATC GCG GAA GCC ATC GCC CAC GCC GTG CCC 
1056 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala Val Pro 

340 345 350 

TTC TAC GGC TGC GTG AAC TGG ACC AAT GAG AAC TTT CCG TTC AAC GAT 
1104 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

TGC GTC GAC AAG ATG GTG ATC TGG TGG GAG GAG GGC AAG ATG ACG GCC 
1152 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

AAG GTC GTA GAG AGC GCC AAG GCC ATC CTG GGC GGA AGC AAG GTG CGC 
1200 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

GTG GAC CAA AAG TGC AAG TCA TCG GCC CAG ATC GAC CCA ACT CCC GTG 
1248 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 

4 05 410 4 15 

ATC GTC ACC TCC AAC ACC AAC ATG TGC GCG GTC ATC GAC GGA AAC TCG 
1296 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 

420 425 430 

ACC ACC TTC GAG CAC CAA CAA CCA CTC CAG GAC CGG ATG TTC AAG TTC 
1344 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

GAG CTC ACC AAG CGC CTG GAG CAC GAC TTT GGC AAG GTC ACC AAG CAG 
1392 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

4 50 455 4 60 

GAA GTC AAA GAC TTT TTC CGG TGG GCG TCA GAT CAC GTG ACC GAG GTG 
1440 

Glu Val Lys Asp Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 ~ 470 475 480 

ACT CAC GAG TTT TAC GTC AGA AAG GGT GGA GCT AGA AAG AGG CCC GCC 
1488 

Thr His Glu Phe Tyr Val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

485 490 495 

CCC AAT GAC GCA GAT ATA AGT GAG CCC AAG CGG GCC TGT CCG TCA GTT 
1536 

Pro Asn Asp Ala Asp He Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 

500 505 510 

GCG CAG CCA TCG ACG TCA GAC GCG GAA GCT CCG GTG GAC TAC GCG GAC 
1584 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Pro Val Asp Tyr Ala Asp 

515 520 525 

AGG TAC CAA AAC AAA TGT TCT CGT CAC GTG GGT ATG AAT CTG ATG CTT 
1632 

Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

TTT CCC TGC CGG CAA TGC aSfe.355S^W7 CAG AAT GTG GAC ATT TGC 

1680 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Val Asp He Cys 
545 550 555 560 

TTC ACG CAC GGG GTC ATG GAC TGT GCC GAG TGC TTC CCC GTG TCA GAA 
1728 

Phe Thr His Gly Val Met Asp Cys Ala Glu Cys Phe Pro Val Ser Glu 

565 570 575 

TCT CAA CCC GTG TCT GTC GTC AGA AAG CGG ACG TAT CAG AAA CTG TGT 
1776 

79 
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Ser Gin 


Pro 


val 


Ser 


val 


val 


Arg 


Lys 


Arg 


Thr 


Tyr 


Gin 


Lys 


Leu 


Cys 




580 








585 










590 






CCG ATT 


CAT 


CAC 


ATC 


ATG 


GGG 


AGG 


GCG 


CCC 


GAG 


GTG 


GCC 


TGC 


TCG 


GCC 


1824 




























Ala 


Pro lie 


His 


His 


He 


Met 


Gly 


Arg 


Ala 


Pro 


Glu 


Val 


Ala 


Cys 


Ser 




595 










600 










605 








TGC GAA 


CTG 


GCC 


AAT 


GTG 


GAC 


TTG 


GAT 


GAC 


TGT 


GAC 


ATG 


GAA 


CAA 


TAA 


1872 
























Glu 


Gin 




Cys Glu 


Leu 


Ala 


Asn 


Val 


Asp 


Leu 


Asp 


Asp 


Cys 


Asp 


Met 


* 


610 










615 










620 











SEQ ID NO: 4 

AAV4 caps id protein VP1 





inr 


Asp 


f2l \y 
\j x y 


xyx 


Leu 'Pro 


Asp 


Trn 


Leu 


Glu 


Asp 


Asn 


Leu 


Ser 


Glu 


1 




5 










10 










15 






vai 


Arg 


Glu 


Trp 


Trp 




Leu 


Gin 


Pro 


Glv 


Ala 


Pro 


Lys 


Pro 


Lys 




2 0 










25 










30 




Gly 




A c ti 
nail 


Gin 


Gin 


His 


bin 


Asp 


Asn 


Ala 


Arq 


Gly 


Leu 


val 


Leu 


Pro 






35 










40 










45 






Val 


Tyr 


Lys 


Tyr 


bell 


vj j. y 


Pro 


Gly 


Asn 


Glv 


Leu 


Asp 


Lvs 


Gly Glu 


Pro 


5 0 








55 










60 








Gin 


Asn 


fild 


/A _L CI 


Asp 


Ala 


Ala 


Ala 


Leu 


Glu 


His 


Asp 


Lys 


Ala 


Tyr 


Asp 


65 








70 










75 






His 




80 




Leu 


Lys 


Al a 
nia 


Gl V 


Asp 


Asn 


Pro 


Tvr 


Leu 


Lvs 


Tvr 


Asn 


Ala 


Asp 








85 










90 










95 




/via 


VjJ. U. 


rile 


Gin 


Gin 


Arg 


Leu 


Gin 


Glv 


Asp 


Thr 


Ser 


Phe 


Gly 


Gly 


Asn 








100 








105 










110 






Leu 




Arg 


Al a 


Val 


Pne 


Gin 


Ala 


Lvs 


LVS 


Arq 


Val 


Leu 


Glu 


Pro 


Leu 




115 










120 










125 










Leu 


Vdl 




m n 

will 


Ala 


Gly 


Glu 


Thr 


Ala 


Pro 


Glv 


Lys 


Lys 


Arg 


Pro 


X j \J 










135 










140 






Gly 




Leu 


Tip 

lie 


uiu 


ser 


Pro 


Gin 


Gin 


Pro 


As "D 


Ser 


Ser 


Thr 


Gly 


He 


Lys 


145 










150 










155 










160 


Lys 


vj±y 


T ire 




Pro 


Ala 


Lys 


Lys 


Lys 


Leu 


Val 


Phe 


Glu 


Asp 


Glu 


Thr 




165 










170 










175 




tj±y 


J\±cL 




Asp 


wiy 


Pro 


Pro 


Glu. 


Glv 


Ser 


Thr 


Ser 


Gly Ala 


Met 


Ser 




180 










185 










190 






Asp 


Asp 


Ser 


pi ii 


Mot* 


Arg 


Ala 


Ala 


Ala 


Glv 


Glv 


Ala 


Ala 


val 


Glu 


Gly 


195 










200 










205 




His 






r* 1 n 
\j JLIl 


n 1 v 
o± y 


Al a 

/ A J. ci 


Mop 


uiy 


Val 

v a x 


Glv 


Asn 


Ala 


Ser 


Gly 


Asp 


Trp 


Cys 


210 








215 










220 










Asp 


Ser 


Thr 


Trp 


Ser 


Glu 


Gly 


His 


Val 


Thr 


Thr 


Thr 


Ser 


Thr 


Arg 


Thr 


225 








230 










235 










240 


Trp 


Val 


Leu 


Pro 


Thr 


Tyr 


Asn 


Asn 


His 


Leu 


Tyr 


Lys 


Arg 


Leu 


Gly 


Glu 








245 










250 










255 




Ser 


Leu 


Gin 


Ser 


Asn 


Thr 


Tyr 


Asn 


Gly 


Phe 


Ser 


Thr 


Pro 


Trp 


Gly 


Tyr 








260 










265 










270 




Gin 


Phe 


Asp 


Phe 


Asn 


Arg 


Phe 


His 


Cys 


His 


Phe 


Ser 


Pro 


Arg 


Asp 


Trp 




275 










280 










285 






Val 


Arg 


Leu 


He 


Asn 


Asn 


Asn 


Trp 


Gly 


Met 


Arg 


Pro 


Lys 


Ala 


Met 


Arg 


290 










295 










300 










Lys 


He 


Phe 


Asn 


He 


Gin 


Val 


Lys 


Glu 


Val 


Thr 


Thr 


Ser 


Asn 


Gly 


Glu 


305 










310 










315 










320 


Thr 


Thr 


Val 


Ala 


Asn 
325 


Asn 


Leu 


Thr 


Ser 


Thr 
330 


Val 


Gin 


He 


Phe 


Ala 

335 


Asp 


Ser 


Ser 


Tyr 


Glu 


Leu 


Pro 


Tyr 


Val 


Met 


Asp 


Ala 


Gly 


Gin 


Glu 


Gly 


Ser 






340 










345 










350 






Leu 


Pro 


Pro 
355 


Phe 


Pro 


Asn 


Asp 


val 

360 


Phe 


Met 


Val 


Pro 


Gin 
365 


Tyr 


Gly 


Tyr 


Cys 


Gly 


Leu 


Val 


Thr 


Gly 


Asn 


Thr 


Ser 


Gin 


Gin 


Gin 


Thr 


Asp 


Arg 


Asn 


370 










375 










380 






Thr 


Gly 


Ala 


Phe 


Tyr 


Cys 


Leu 


Glu 


Tyr 


Phe 


Pro 


Ser 


Gin 


Met 


Leu 


Arg 



80 

233334_I7.DOC 



ATTORNEY DOCKET NO. 14014.0427U1 



3 85 










390 










395 








His 


400 


As n 


Asn 


Phe 


Glu 


He 


Thr 


Tvr 


Ser 


Phe 


Glu 


Lys 


Val 


Pro 


Phe 


Ser 










405 








410 










415 




Met 


Tvr 


Ala 


His 


Ser 


Gin 


Ser 


Leu 


Asp 


Arg 


Leu 


Met 


Asn 


Pro 


Leu 


He 






4 2 0 










425 










430 






Asp 


Gin 


Tvr 


Leu 


Tro 

x -"-XT 


Glv 


Leu 


Gin 


Ser 


Thr 


Thr 


Thr 


Gly 


Thr 


Thr 


Leu 




43 5 










440 










445 




Thr 




Asn 


Ala 


Glv 


Thr 


Ala 


Thr 


Thr 


Asn 


Phe 


Thr 


Lys 


Leu 


Arg 


Pro 


Asn 




4 50 








455 










460 










Phe 


Ser 


Asn 


Phe 


Lvs 


Lvs 


Asn 


Tro 


Leu 


Pro 


Gly 


Pro 


Ser 


He 


Lys 


Gin 


4 6 5 








47 0 










475 










480 


Gin 


Glv 
U1 / 


Phe 


Ser 


Lys 


Thr 


Ala 


Asn 


Gin 


Asn 


Tyr 


Lys 


He 


Pro 


Ala 


Thr 








*4 O J 










490 










495 






Ser 


Asp 


Ser 


Leu 


He 


LVS 


Tvr 


Glu 


Thr 


His 


Ser 


Thr 


Leu 


Asp 


Gly 






500 










505 










510 






Arg 


Trp 


Ser 


Ala 


Leu 


Thr 


Pro 


Gly 


Pro 


Pro 


wet 


Ala 


Thr 


Ala 


Gly 


Pro 


515 










520 










525 








Ala 


Asp 


Ser 


Lys 


Phe 


Ser 


Asn 


Ser 


Gin 


Leu 


lie 


Phe 


Ala 


Gly 


Pro 


Lys 




53 0 








53 5 










540 










Gin 


As n 


Glv 


Asn 


Thr 


Ala 


Thr 


Val 


Pro 


Gly 


Thr 


Leu 


He 


Phe 


Thr 


Ser 


54 5 








550 










555 










560 


Glu 


Glu 


Glu 


Leu 


Ala 
5 65 


Ala 


Thr 


Asn 


Ala 


Thr 
570 


Asp 


Thr 


Asp 


Met 


Trp 
575 


Gly 


As n 


Leu 


Pro 


Glv 


Glv 


Asp 


Gin 


Ser 


Asn 


Ser 


Asn 


Leu 


Pro 


Thr 


Val 


Asp 








5 8 0 








585 










590 






Arg 


Leu 


Thr 


Ala 


Leu 


Gly 


Ala 


val 


Pro 


Gly 


Met 


val 


Trp 


Gin 


Asn 


Arg 




C Q C 










600 










605 








Asp 


He 


Tyr 


Tyr 


Gin 


Gly 


Pro 


He 


Trp 


Ala 


Lys 


He 


Pro 


His 


Thr 


Asp 


610 










615 










620 










Gly 


His 


Phe 


His 


Pro 


Ser 


Pro 


Leu 


He 


Gly 


Gly 


Phe 


Gly 


Leu 


Lys 


His 


£ o ^ 










630 










635 










640 


Pro 


Pro 


Pro 


Gin 


He 
645 


Phe 


He 


Lys 


Asn 


Thr 
650 


Pro 


Val 


Pro 


Ala 


Asn 
655 


Pro 


>\ J. ct 


Thr 


Thr 


Phe 
660 


Ser 


Ser 


Thr 


Pro 


Val 
665 


Asn 


Ser 


Phe 


He 


Thr 
670 


Gin 


Tyr 


Ss r 


Thr 


Gly 


Gin 


Val 


Ser 


Val 


Gin 
680 


He 


Asp 


Trp 


Glu 


He 
685 


Gin 


. Lys 


Glu 


Arg 


Ser 


Lys 


Arg 


Trp 


Asn 


Pro 


Glu 


val 


Gin 


Phe 


Thr 


Ser 


Asn 


Tyr 


Gly 


690 










695 










700 








Thr 


Gin 


Gin 


Asn 


Ser 


Leu 


Leu 


Trp 


Ala 


Pro 


Asp 


Ala 


Ala 


Gly 


Lys 


Tyr 


705 










710 










715 










720 


Glu 


Pro 


Arg 


Ala 


He 


Gly 


Thr 


Arg 


Tyr 


Leu 


Thr 


His 


His 


Leu 












725 










730 















SEQ ID NO : 5 

AAV4 capsid protein VP1 gene 



atgactgacg 
tggtgggcgc 
gctcggggtc 
gg99aacccg 
cagctcaagg 
cagcggcttc 
aaaaagaggg 
aagaagagac 
aaaggcaagc 
ggaccccctg 
gctggcggag 
gattggcatt 
tgggtcttgc 
aacacctaca 
cacttctcac 
gccatgcggg 
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gttaccttcc 
tgcaacctgg 
ttgtgcttcc 
tcaacgcagc 
ccggtgacaa 
agggcgacac 
ttcttgaacc 
cgttgattga 
agccggctaa 
agggatcaac 
ctgcagtcga 
gcgattccac 
ccacctacaa 
acggattctc 
cacgtgactg 
tcaaaatctt 



agattggcta 
agcccctaaa 
gggttacaaa 
ggacgcggca 
cccctacctc 
atcgtttggg 
tcttggtctg 
atccccccag 
aaagaagctc 
ttccggagcc 
gggsggacaa 
ctggtctgag 
caaccacctn 
caccccctgg 
gcagcgactc 
caacatccag 



gaggacaacc 
cccaaggcaa 
tacctcggac 
gccctcgagc 
aagtacaacc 
ggcaacctcg 
gttgagcaag 
cagcccgact 
gttttcgaag 
atgtctgatg 
ggtgccgatg 
ggccacgtca 
tacaagcgac 
ggatactttg 
atcaacaaca 
gtcaaggagg 

81 



tctctgaagg 
atcaacaaca 
ccggcaacgg 
acgacaaggc 
acgccgacgc 
gcagagcagt 
cgggtgagac 
cctccacggg 
acgaaactgg 
acagtgagat 
gagtgggtaa 
cgaccaccag 
tcggagagag 
acttcaaccg 
actggggcat 
tcacgacgtc 



cgttcgagag 
tcaggacaac 
actcgacaag 
ctacgaccag 
ggagttccag 
cttccaggcc 
ggctcctgga 
tatcggcaaa 
agcaggcgac 
gcgtgcagca 
tgcctcgggt 
caccagaacc 
cctgcagtcc 
cttccactgc 
gcgacccaaa 
gaacggcgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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acaacggtgg 
ctgccgtacg 
tttatggtgc 
actgacagaa 
aacaactttg 
agccagagcc 
tcgaccacca 
cggcctacca 
cagggcttct 
ctcatcaaat 
cctccaatgg 
gcggggccta 
gaggaggagc 
ggtgaccaga 
cctggaatgg 
cctcataccg 
ccgcctcctc 
agctctactc 
attgactggg 
tccaactacg 
gagcctaggg 



ctaataacct 
tgatggatgc 
cccagtacgg 
atgccttcta 
aaattacgta 
tggaccggct 
ccggaaccac 
acttttccaa 
caaagactgc 
acgagacgca 
ccacggctgg 
aacagaacgg 
tggcagccac 
gcaacagcaa 
tctggcaaaa 
atggacactt 
aaatttttat 
cggtaaactc 
agatccagaa 
gacagcaaaa 
ctatcggtac 



taccagcacg 
gggtcaagag 
ctactgtgga 
ctgcctggag 
cagttttgag 
gatgaaccct 
cctgaatgcc 
ctttaaaaag 
caatcaaaac 
cagcactctg 
acctgcggac 
caacacggcc 
caacgccacc 
cctgccgacc 
cagagacatt 
tcacccctca 
caagaacacc 
cttcattact 
ggagcggtcc 
ctctctgttg 
ccgctacctc 



gttcagatct 
ggcagcctgc 
ctggtgaccg 
tactttcctt 
aaggtgcctt 
ctcatcgacc 
gggactgcca 
aactggctgc 
tacaagatcc 
gacggaagat 
agcaagttca 
accgtacccg 
gatacggaca 
gtggacagac 
tactaccagg 
ccgctgattg 
ccggtacctg 
cagtacagca 
aaacgctgga 
tgggctcccg 
acccaccacc 



ttgcggactc 
ctccttttcc 
gcaacacttc 
cgcagatgct 
tccactcgat 
agtacctgtg 
ccaccaactt 
ccgggccttc 
ctgccaccgg 
ggagtgccct 
gcaacagcca 
ggactctgat 
tgtggggcaa 
tgacagcctt 
gtcccatttg 
gtgggtttgg 
cgaatcctgc 
ctggccaggt 
accccgaggt 
atgcggctgg 
tgtaataa 



gtcgtacgaa 
caacgacgtc 
gcagcaacag 
gcggactggc 
gtacgcgcac 
gggactgcaa 
taccaagctg 
aatcaagcag 
gtcagacagt 
gacccccgga 
gctcatcttt 
cttcacctct 
cctacctggc 
gggagccgtg 
ggccaagatt 
gctgaaacac 
aacgaccttc 
gtcggtgcag 
ccagtttacc 
gaaatacact 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2208 



SEQ ID NO: 6 

AAV4 ITR "flip" orientation 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctggagacc aaaggtctcc 
agactgccgg cctctggccg gcagggccga gtgagtgagc gagcgcgcat agagggagtg 
gccaa 

SEQ ID NO: 7 

AAV4 p5 promoter 

ctccatcatc taggtttgcc cactgacgtc aatgtgacgt cctagggtta gggaggtccc 
tgtattagca gtcacgtgag tgtcgtattt cgcggagcgt agcggagcgc ataccaagct 
gccacgtcac agccacgtgg tccgtttgcg acagtttgcg acaccatgtg gtcaggaggg 
tatataaccg cgagtgagcc agcgaggagc tccattttgc ccgcgaattt tgaacgagca 
gcagc 



60 
120 
125 



60 
120 
180 
240 
245 



SEQ ID NO: 8 



AAV4 


Rep 


protein 


40 








Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


val 


1 








5 








Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 








20 










Ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 






35 










40 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala 




50 










55 




Pro 


Pro 


Glu 


Asp 


He 


Ser 


Ser 


Asn 


65 










70 






Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 










85 








Gin 


Lys 


Lys 


Phe 


Gly 


Ijys 


Arg 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 






115 










120 


Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 




130 










135 




Cys 


Val 


Asp 


Lys 


Met 


Val 


He 


Trp 


145 










150 






Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 
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Asp 


Arg 


Gly 


He 


Thr 


Ser 


Glu 


Lys 




10 










15 




Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 


25 








30 






Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Ser 


Lys 










45 








Pro 


Asp 


Tyr 


Leu 


Val 


Gly 


Gin 


Asn 








60 










Arg 


He 


Tyr 


Arg 


He 


Leu 


Glu 


Met 




75 










80 


Ala 


Ser 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 




90 










95 




Tnr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 


105 










110 






Glu 


Ala 


He 


Ala 


His 


Ala 


Val 


Pro 










125 








Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 








140 










Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 






155 










160 


lie 


Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Ar 9 



82 
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Val 


Asp 


Gin 


Lys 








180 


lie 


val 


Thr 


Ser 






195 




1 111. 


i hi. 


Phe 


Glu 




210 






olU 


Le u 


1 111 


u y o 


O 0 ^ 
Ct c> O. 










val 


T ,vc 
uy t> 




Thr 


His 


Glu 


Phe 








2 6 0 


Pro 


Asn 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Leu 


Ala 


Arg 


305 








SEQ ID NO: 9 




AAV 4 


Rep 


protein 


Met 


Glu 


Leu 


Val 


1 








Gin 


Trp 


lie 


Gin 








20 


Ser 


Asn 


Ser 


Arg 






35 




He 


Met 


Ser 


Leu 




50 






Pro 


Pro 


Glu 


Asp 


65 








Asn 


Gly 


Tyr 


Asp 


Gin 


Lys 


Lys 


Phe 








100 


Thr 


Thr 


Gly 


Lys 






115 




Phe 


Tyr 


Gly 


Cys 




130 






Cys 


Val 


Asp 


Lys 


145 








Lys 


Val 


Val 


Glu 


Val 


Asp 


Gin 


Lys 








180 


He 


Val 


Thr 


Ser 






195 




Thr 


Thr 


Phe 


Glu 




210 






Glu 


Leu 


Thr 


Lys 


225 








Glu 


Val 


Lys 


Asp 


Thr 


His 


Glu 


Phe 








260 


Pro 


Asn 


Asp 


Ala 






275 




Ala 


Gin 


Pro 


Ser 




290 






Arg 


Tyr 


Gin 


Asn 


305 
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-1 O -3 










U jr O 


Ser 


Ser 


Asn 


Thr 


Asn 


Met 








200 


His 


Gin 


Gin 


Pro 






215 






Leu 


Glu 


Hi s 




j u 








Dh <=» 

flic 


Arg 


Trp 


OAR 








xyxr 






Lys 


Asp 


lie 


Ser 


Glu 








280 


Thr 


Ser 


Asp 


Ala 






295 




Gly 


Gin 


Pro 


Leu 




310 







52 








Gly 


Trp 


Leu 


Val 


5 








Glu 


Asp 


Gin 


Ala 


Ser 


Gin 


lie 


Lys 








40 


Thr 


Lys 


Thr 


TV T — 

Ala 






55 




lie 


Ser 


Ser 


Asn 




7 0 






Pro 


Gin 


Tyr 


Ala 


85 








Gly 


Lys 


Arg 


Asn 


Thr 


Asn 


He 


Ala 








120 


Val 


Asn 


Trp 


Thr 






135 




Met 


Val 


He 


Trp 




150 






Ser 


Ala 


Lys 


Ala 


165 








Cys 


Lys 


Ser 


Ser 


Asn 


Thr 


Asn 


Met 








200 


His 


Gin 


Gin 


Pro 






215 




Arg 


Leu 


Glu 


His 




23 0 






Phe 


Phe 


Arg 


Trp 


245 








Tyr 


Val 


Arg 


Lys 


Asp 


lie 


Ser 


Glu 








280 


Thr 


Ser 


Asp 


Ala 






295 




Lys 


Cys 


Ser 


Arg 




310 









inn 
X / \J 






Ala 


Gin 


He 


Asd 


JL O J 










Ala 


Val 


lie 


Leu 


Gin 


Asp 


Arcr 








220 


Asp 


Phe 


Gly 


Lys 






235 




Aid 


Ser 


Asp 


n -L & 














i-V J. ct 


TV "f-ci 


265 








Pro 


Lys 


Arg 


Ala 


Glu 


Ala 


Pro 


val 








300 


Xaa 









Asp Arg Gly lie 
10 

Ser Tyr lie Ser 
25 

Ala Ala Leu Asp 

Pro Asp Tyr Leu 
60 

Arg lie Tyr Arg 
75 

Ala Ser Val Phe 
90 

Thr lie Trp Leu 
105 

Glu Ala He Ala 

Asn Glu Asn Phe 
140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 

Leu Gin Asp Arg 
220 

Asp Phe Gly Lys 
235 

Ala Ser Asp His 

Gly Gly Ala Arg 
265 

Pro Lys Arg Ala 

Glu Ala Pro Val 
300 

His Val Gly Met 
315 

83 







175 




Pro 


Thr 


Pro 


Val 




190 






Asp 


Gly 


Asn 


Ser 


205 








Met 


Phe 


Lys 


Phe 


Val 


Thr 


Lys 


Gin 








240 


Val 


Thr 


Glu 


val 






255 




Lys 


Arg 


Pro 


Ala 




270 






Cys 


Pro 


Ser 


Val 


285 








Asp 


Tyr 


Ala 


Asp 



Thr 


Ser 


1*21 n 
bill 


Lys 






19 




Phe 


Asn 


Ala 


Ala 




fi 
J u 






Asn 


Ala 


Ser 


Lys 










Val 


Gly 


Gin 


Asn 


lie 


Leu 














Leu 


Gly 


Trp 


Ala 






95 




Phe 


Gly 


Pro 


Ala 




110 






His 


Ala 


Val 


Pro 


125 








Pro 


Phe 


Asn 


Asp 


Lys 


Met 


Thr 


Ala 








160 


Ser 


Lys 


Val 


Arg 






175 




Pro 


Thr 


Pro 


Val 




190 






Asp 


Gly 


Asn 


Ser 


205 








Met 


Phe 


Lys 


Phe 


Val 


Thr 


Lys 


Gin 








240 


Val 


Thr 


Glu 


Val 






255 




Lys 


Arg 


Pro 


Ala 




270 






Cys 


Pro 


Ser 


val 


285 








Asp 


Tyr 


Ala 


Asp 


Asn 


Leu 


Met 


Leu 








320 
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rne 


Pro 


Cys 




rile 


Thr 
X 111. 




oxy 








340 


Ssr 






Val 






355 




Pro 


He 


His 


Hi s 




370 






Cys 


Glu 


Leu 


Ala 


385 








SEQ XD NO: 10 




AAV 4 


Rep 


protein 


Met 


Pro 


Gly 


Phe 


1 








Glu 


His 


Leu 


Pro 








20 


Lys 


Glu 


Trp 


Glu 






35 




Glu 


Gin 


Ala 


Pro 




5 0 






Val 


Glu 


Trp 


Arg 


65 








Gin 


Phe 


Glu 


Lys 


Thr 


Val 


Gly 


Val 








100 


Lys 


Glu 


Lys 


Leu 






115 




Pro 


Asn 


Trp 


Phe 




130 






Asn 


Lys 


Val 


Val 


14 5 








Thr 


Gin 


Pro 


Glu 


Ser 


Ala 


Cvs 


Leu 








180 


Leu 


Thr 


His 


Val 






195 




Pro 


Asn 


Ser 


Asp 




210 






Met 


Glu 


Leu 


Val 


Z Z O 








Gin 


Trp 


He 


Gin 


Ser 


Asn 


Ser 


Arg 








260 


He 


Met 


Ser 


Leu 






275 




Pro 


Pro 


Glu 


Asp 




290 






Asn 


Gly 


Tyr 


Asp 


305 








Gin 


Lys 


Lys 


Phe 


Thr 


Thr 


Gly 


Lys 








340 


Phe 


Tyr 


Gly 


Cys 






355 




Cys 


Val 


Asp 


Lys 




370 






Lys 


Val 


Val 


Glu 



Gin 


Cvs 


Glu 


Ara 


"IOC 








Val 


Met 






Ser 


Val 


Val 


Arg 








360 


He 


Met 


Gly 


Arg 






375 




Asn 


Val 


Asp 


Leu 




390 






© o 








Tyr 


ol U 


x xcr 


V Cl _L 


tr 










He 


Ser 


Asp 


Leu 


Pro 


Pro 


Asp 








4 0 


Leu 


Thr 


Val 


Ala 






55 




Ml y 


Val 


Ser 


Lys 












nop 


Ser 


a y x 


ft c: 

O D 








Lys 


Ser 


Met 


Val 


Va 1 

V CX J. 


Thr 




He 








1 TO 
X £. KJ 


Ala 


Val 


Thr 


Lys 






135 




Asp 


Asp 


L / S 


1 y i 




150 






Leu 


Gin 


Trp 


Ala 


165 








Asn 


Leu 


Ala 


Glu 


Ser 


Gin 


Thr 


Gin 



200 

Ala Pro Val He 
215 

Gly Trp Leu Val 
230 

Glu Asp Gin Ala 
245 

Ser Gin He Lys 

Thr Lys Thr Ala 
280 

He Ser Ser Asn 
295 

Pro Gin Tyr Ala 
310 

Gly Lys Arg Asn 
325 

Thr Asn He Ala 

Val Asn Trp Thr 
360 

Met Val lie Trp 
375 

Ser Ala Lys Ala 



Met Asn Gin Asn 
330 

Ala Glu Cys Phe 
345 

Lys Arg Thr Tyr 

Ala Pro Glu Val 
380 

Asp Asp Cys Asp 
395 



Leu Lys Val Pro 
10 

Ser Phe val Ser 
25 

Ser Asp Met Asp 

Glu Lys Leu Gin 
60 

Ala Pro Glu Ala 
75 

Phe His Leu His 
90 

Val Gly Arg Tyr 
105 

Tyr Arg Gly Val 

Thr Arg Asn Gly 
140 

He Pro Asn Tyr 
155 

Trp Thr Asn Met 
170 

Arg Lys Arg Leu 
185 

Glu Gin Asn Lys 

Arg Ser Lys Thr 
220 

Asp Arg Gly He 
235 

Ser Tyr He Ser 
250 

Ala Ala Leu Asp 
265 

Pro Asp Tyr Leu 

Arg He Tyr Arg 
300 

Ala Ser Val Phe 
315 

Thr He Trp Leu 
330 

Glu Ala He Ala 
345 

Asn Glu Asn Phe 

Trp Glu Glu Gly 
3 80 

He Leu Gly Gly 
84 



Val Asp He Cys 
335 

Pro Val Ser Glu 
350 

Gin Lys Leu Cys 
365 

Ala Cys Ser Ala 
Met Glu Gin 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
3 0 

Leu Asn Leu He 
45 

Arg Glu Phe Leu 

Leu Phe Phe Val 
80 

He Leu val Glu 
95 

Val Ser Gin He 
110 

Glu Pro Gin Leu 
125 

Ala Gly Gly Gly 

Leu Leu Pro Lys 
160 

Asp Gin Tyr He 
175 

Val Ala Gin His 
190 

Glu Asn Gin Asn 
205 

Ser Ala Arg Tyr 

Thr Ser Glu Lys 
240 

Phe Asn Ala Ala 
255 

Asn Ala Ser Lys 
270 

Val Gly Gin Asn 
285 

He Leu Glu Met 

Leu Gly Trp Ala 
320 

Phe Gly Pro Ala 

335 

His Ala Val Pro 
350 

Pro Phe Asn Asp 
365 

Lys Met Thr Ala 
Ser Lys Val Arg 
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385 390 395 400 



val 


Asp 


Gin 


Lvs 


Cvs 


LVS 


Ser 


Ser 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 








405 










410 










415 




lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


val 


He 


Asp 


Gly 


Asn 


Ser 








420 










425 










430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 






435 










440 










445 


Thr 






VJl Li. 


Leu 


Thr 


Ly s 




Leu 


Glu 


His 


Asd 


Phe 


Gly 


Lys 


Val 


Lys 


Gin 




450 








455 










460 










Glu 


Val 


Lvs 


Asd 


Phe 


Phe 


Ara 


Trp 


Ala 


Ser 


Asp 


His 


Val 


Thr 


Glu 


Val 


465 






470 










475 










480 


Thr 


His 


Glu 


Phe 


Tyr 


Val 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Lys 


Arg 


Pro 


Ala 










485 










490 










495 




Pro 


Asn 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg 


Ala 


Cys 


Pro 


Ser 


Val 






500 








505 










510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Pro 


Val 


Asp 


Tyr 


Ala 


Asp 






515 










520 










525 








Arg 


Leu 


Ala 


Arg 


Gly 


Gin 


Pro 


Leu 


Xaa 

















530 535 



SEQ ID NO: 11 

AAV 4 Rep protein 7 8 



Met 


Pro 


Gly 


Phe 


Tyr 


Glu 


lie 


vai 


Leu 


Lys 


vai 


Pro 


Ser 


Asp 


Leu 


Asp 


1 






5 










10 










15 




Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 


Asp 


Ser 


Phe 


Val 


Ser 


Trp 


Val 


Ala 


Glu 








2 0 








2 5 










30 




He 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 


Asp 


Ser 


Asp 




Asp 


Leu 


Asn 


Leu 




35 










40 


















Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


Ala 


Glu 


Lys 


Leu 


Gin 


Arg 


Glu 


Phe 


Leu 




50 










55 


















Val 


Val 


Glu 


Trp 


Arg 


Arg 


Val 


Ser 


Lys 


Ala 


Pro 


Glu 


ax a 


T an 
JjcU 


Phe 


Phe 


65 




7 O 




















80 


Gin 


Phe 


Glu 


Lys 


Gly 


Asp 


Ser 


Tyr 


Phe 


His 


Leu 


His 


He 


Leu 


Val 


Glu 








85 










90 










95 




Thr 


Val 


Gly 


Val 


Lys 


Ser 


Met 


Val 


Val 


Gly 


Arg 


Tyr 


Val 


Ser 


Gin 


He 






100 










105 










110 






Lys 


Glu 


Lys 


Leu 


Val 


Thr 


Arg 


He 


Tyr 


Arg 


Gly 


Val 


Glu 


Pro 


Gin 


Leu 




115 










120 










125 








Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys 


Thr 


Arg 


Asn 


Gly 


Ala 


Gly Gly Gly 




130 








135 










140 










Asn 


Lys 


Val 


Val 


Asp 


Asp 


Cys 


Tyr 


He 


Pro 


Asn 


Tyr 


Leu 


Leu 


Pro 


Lys 


145 








150 










155 










160 


Thr 


Gin 


Pro 


Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 


Met 


Asp 


Gin 


Tyr 


He 










165 










170 










175 


His 


Ser 


Ala 


Cys 


Leu 


Asn 


Leu 


Ala 


Glu 


Arg 


Lys 


Arg 


Leu 


Val 


Ala 


Gin 






180 










185 










190 






Leu 


Thr 


His 
195 


Val 


Ser 


Gin 


Thr 


Gin 
200 


Glu 


Gin 


Asn 


Lys 


Glu 
205 


Asn 


Gin 


Asn 


Pro 


Asn 


Ser 


Asp 


Ala 


Pro 


Val 


He 


Arg 


Ser 


Lys 


Thr 


Ser 


Ala 


Arg 


Tyr 




210 








215 










220 










Met 


Glu 


Leu 


Val 


Gly 


Trp 


Leu 


Val 


Asp 


Arg 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


225 








230 










23 5 










240 


Gin 


Trp 


He 


Gin 


Glu 


Asp 


Gin 


Ala 


Ser 


Tyr 


He 


Ser 


Phe 


Asn 


Ala 


Ala 








245 


















255 




Ser 


Asn 


Ser 


Arg 
260 


Ser 


Gin 


He 


Lys 


Ala 
265 


Ala 


Leu 


Asp 


Asn 


Ala 
270 


Ser 


Lys 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Al a Rr_o_Asp 


Tyr 


Leu 


val 


Gly 


Gin 


Asn 






275 








280 










285 








Pro 


Pro 
290 


Glu 


Asp 


He 


Ser 


Ser 
295 


Asn 


Arg 


He 


Tyr 


Arg 
300 


He 


Leu 


Glu 


Met 


Asn 


Gly 


Tyr 


Asp 


Pro 


Gin 


Tyr 


Ala 


Ala 


Ser 


Val 


Phe 


Leu 


Gly 


Trp 


Ala 


305 




310 










315 










320 



85 
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Gin 


Lys 


Lys 


Phe 


Gly 


Lys 


Arg 


Asn 


Thr 


He 


Trp 


Leu 


Phe 


Gly 


Pro 


Ala 






325 










330 










335 




Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Ala 


Val 


Pro 






340 










345 










350 






Phe 


Tyr 


Gly 


Cys 


val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 




355 










360 










365 








Cvs 


Val 


Asp 


LVS 


Met 


Val 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Ala 


370 










375 










380 










Lvs 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Arg 


3 85 










390 










395 










400 


Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Ala 


Gin 


He 


Asp 


Pro 


Thr 


Pro 


Val 








405 










410 










415 




lie 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


Ala 


Val 


He 


Asp 


Gly 


Asn 


Ser 








42 0 










425 










430 






Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 






4 3 5 










440 










445 








Glu 


Levi 


Thr 


Lvs 


Ara 


Leu 


Glu 


His 


Asp 


Phe 


Gly 


Lys 


Val 


Thr 


Lys 


Gin 




450 








455 










460 










Glu 


Val 


Lvs 


Asp 

It 


Phe 


Phe 


Arg 


Trp 


Ala 


Ser 


Asp 


His 


Val 


Thr 


Glu 


Val 


4 65 






470 










475 










480 


Thr 


His 


Glu 


Phe 


Tyr 


Val 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Lys 


Arg 


Pro 


Ala 










485 










490 










495 




Pro 


Asn 


Asp 


Ala 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg 


Ala 


Cys 


Pro 


Ser 


Val 






500 










505 










510 






Ala 


Gin 


Pro 


Ser 


Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Pro 


Val 


Asp 


Tyr 


Ala 


Asp 






515 










520 










525 








Ara 


Tvr 


Gin 


Asn 


Lvs 


Cys 


Ser 


Arg 


His 


Val 


Gly 


Met 


Asn 


Leu 


Met 


Leu 


53 0 










535 










540 










Phe 


Pro 


Cvs 


ArQ 


Gin 


Cys 


Glu 


Arg 


Met 


Asn 


Gin 


Asn 


Val 


Asp 


He 


Cys 


545 






550 










555 










560 


Phe 


Thr 


His 


Gly 


Val 


Met 


Asp 


Cys 


Ala 


Glu 


Cys 


Phe 


Pro 


Val 


Ser 


Glu 








565 










570 










575 




Ser 


Gin 


Pro 


Val 


Ser 


Val 


Val 


Arg 


Lys 


Arg 


Thr 


Tyr 


Gin 


Lys 


Leu 


Cys 








580 










585 










590 






Pro 


He 


His 


His 


He 


Met 


Gly 


Arg 


Ala 


Pro 


Glu 


Val 


Ala 


Cys 


Ser 


Ala 






595 










600 










605 








Cys 


Glu 


Leu 


Ala 


Asn 


val 


Asp 


Leu 


Asp 


Asp 


Cys 


Asp 


Met 


Glu 


Gin 





610 615 620 



SEQ ID NO: 12 
AAV4 Rep 40 gene 

atggagctgg tcgggtggct ggtggaccgc 
gaggaccagg cgtcctacat ctccttcaac 
gccgcgctgg acaatgcctc caaaatcatg 
gtgggccaga acccgccgga ggacatttcc 
aacgggtacg atccgcagta cgcggcctcc 
gggaagagga acaccatctg gctctttggg 
gaagccatcg cccacgccgt gcccttctac 
ccgttcaacg attgcgtcga caagatggtg 
aaggtcgtag agagcgccaa ggccatcctg 
tgcaagtcat cggcccagat cgacccaact 
tgcgcggtca tcgacggaaa ctcgaccacc 
atgttcaagt tcgagctcac caagcgcctg 
gaagtcaaag actttttccg gtgggcgtca 
tacgtcagaa agggtggagc tagaaagagg 
cccaagcggg cctgtccgtc agttgcgcag 
gactacgcgg acagattggc tagaggacaa 



AAV4 Rep 52 gene 
SEQ ID NOtl3 
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gggatcacgt cagaaaagca 
gccgcctcca actcgcggtc 
agcctgacaa agacggctcc 
agcaaccgca tctaccgaat 
gtcttcctgg gctgggcgca 
ccggccacga cgggtaaaac 
ggctgcgtga actggaccaa 
atctggtggg aggagggcaa 
ggcggaagca aggtgcgcgt 
cccgtgatcg tcacctccaa 
ttcgagcacc aacaaccact 
gagcacgact ttggcaaggt 
gatcacgtga ccgaggtgac 
cccgccccca atgacgcaga 
ccatcgacgt cagacgcgga 
cctctctga 



86 



atggatccag 60 

acaaatcaag 120 

ggactacctg 180 

cctcgagatg 240 

aaagaagttc 3 00 

caacatcgcg 3 60 

tgagaacttt 420 

gatgacggcc 480 

ggaccaaaag 540 

caccaacatg 600 

ccaggaccgg 660 

caccaagcag 720 

tcacgagttt 780 

tataagtgag 840 

agctccggtg 900 
939 
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atggagctgg tcgggtggct ggtggaccgc gggatcacgt cagaaaagca atggatccag 6 0 

gaggaccagg cgtcctacat ctccttcaac gccgcctcca actcgcggtc acaaatcaag 12 0 

gccgcgctgg acaatgcctc caaaatcatg agcctgacaa agacggctcc ggactacctg 180 

gtgggccaga acccgccgga ggacatttcc agcaaccgca tctaccgaat cctcgagatg 240 

aacgggtacg atccgcagta cgcggcctcc gtcttcctgg gctgggcgca aaagaagttc 300 

gggaagagga acaccatctg gctctttggg ccggccacga cgggtaaaac caacatcgcg 360 

gaagccatcg cccacgccgt gcccttctac ggctgcgtga actggaccaa tgagaacttt 420 

ccgttcaacg attgcgtcga caagatggtg atctggtggg aggagggcaa gatgacggcc 480 

aaggtcgtag agagcgccaa ggccatcctg ggcggaagca aggtgcgcgt ggaccaaaag 54 0 

tgcaagtcat cggcccagat cgacccaact cccgtgatcg tcacctccaa caccaacatg 600 

tgcgcggtca tcgacggaaa ctcgaccacc ttcgagcacc aacaaccact ccaggaccgg 660 

atgttcaagt tcgagctcac caagcgcctg gagcacgact ttggcaaggt caccaagcag 720 

gaagtcaaag actttttccg gtgggcgtca gatcacgtga ccgaggtgac tcacgagttt 780 

tacgtcagaa agggtggagc tagaaagagg cccgccccca atgacgcaga tataagtgag 84 0 

cccaagcggg cctgtccgtc agttgcgcag ccatcgacgt cagacgcgga agctccggtg 900 

gactacgcgg acaggtacca aaacaaatgt tctcgtcacg tgggtatgaa tctgatgctt 960 

tttccctgcc ggcaatgcga gagaatgaat cagaatgtgg acatttgctt cacgcacggg 1020 

gtcatggact gtgccgagtg cttccccgtg tcagaatctc aacccgtgtc tgtcgtcaga 1080 

aagcggacgt atcagaaact gtgtccgatt catcacatca tggggagggc gcccgaggtg 1140 

gcctgctcgg cctgcgaact ggccaatgtg gacttggatg actgtgacat ggaacaa 1197 



SEQ XD NO: 14 
AAV 4 Rep 6 8 gene 

atgccggggt tctacgagat cgtgctgaag gtgcccagcg acctggacga gcacctgccc 60 

ggcatttctg actcttttgt gagctgggtg gccgagaagg aatgggagct gccgccggat 120 

tctgacatgg acttgaatct gattgagcag gcacccctga ccgtggccga aaagctgcaa 180 

cgcgagttcc tggtcgagtg gcgccgcgtg agtaaggccc cggaggccct cttctttgtc 240 

cagttcgaga agggggacag ctacttccac ctgcacatcc tggtggagac cgtgggcgtc 300 

aaatccatgg tggtgggccg ctacgtgagc cagattaaag agaagctggt gacccgcatc 360 

taccgcgggg tcgagccgca gcttccgaac tggttcgcgg tgaccaagac gcgtaatggc 420 

gccggaggcg ggaacaaggt ggtggacgac tgctacatcc ccaactacct gctccccaag 480 

acccagcccg agctccagtg ggcgtggact aacatggacc agtatataag cgcctgtttg 540 

aatctcgcgg agcgtaaacg gctggtggcg cagcatctga cgcacgtgtc gcagacgcag 600 

gagcagaaca aggaaaacca gaaccccaat tctgacgcgc cggtcatcag gtcaaaaacc 660 

tccgccaggt acatggagct ggtcgggtgg ctggtggacc gcgggatcac gtcagaaaag 720 

caatggatcc aggaggacca ggcgtcctac atctccttca acgccgcctc caactcgcgg 780 

tcacaaatca aggccgcgct ggacaatgcc tccaaaatca tgagcctgac aaagacggct 84 0 

ccggactacc tggtgggcca gaacccgccg gaggacattt ccagcaaccg catctaccga 900 

atcctcgaga tgaacgggta cgatccgcag tacgcggcct ccgtcttcct gggctgggcg 960 

caaaagaagt tcgggaagag gaacaccatc tggctctttg ggccggccac gacgggtaaa 102 0 

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt gaactggacc 108 0 

aatgagaact ttccgttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc 1140 

aagatgacgg ccaaggtcgt agagagcgcc aaggccatcc tgggcggaag caaggtgcgc 1200 

gtggaccaaa agtgcaagtc atcggcccag atcgacccaa ctcccgtgat cgtcacctcc 1260 

aacaccaaca tgtgcgcggt catcgacgga aactcgacca ccttcgagca ccaacaacca 1320 

ctccaggacc ggatgttcaa gttcgagctc accaagcgcc tggagcacga ctttggcaag 1380 

gtcaccaagc aggaagtcaa agactttttc cggtgggcgt cagatcacgt gaccgaggtg 144 0 

actcacgagt tttacgtcag aaagggtgga gctagaaaga ggcccgcccc caatgacgca 1500 

gatataagtg agcccaagcg ggcctgtccg tcagttgcgc agccatcgac gtcagacgcg 1560 

gaagctccgg tggactacgc ggacagattg gctagaggac aacctctctg a 1611 



SEQ ID NO: 15 
AAV 4 Rep 7 8 gene 

atgccggggt tctacgagat cgtgctgaag gtgcccagcg acctggacga gcacctgccc 60 

ggcatttctg actcttttgt gagctgggtg gccgagaagg aatgggagct gccgccggat 12 0 

tctgacatgg acttgaatct gattgagcag gcacccctga ccgtggccga aaagctgcaa 180 

cgcgagttcc tggtcgagtg gcgccgcgtg agtaaggccc cggaggccct cttctttgtc 24 0 

cagttcgaga agggggacag ctacttccac ctgcacatcc tggtggagac cgtgggcgtc 300 

aaatccatgg tggtgggccg ctacgtgagc cagattaaag agaagctggt gacccgcatc 360 
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taccgcgggg tcgagccgca gcttccgaac 
gccggaggcg ggaacaaggt ggtggacgac 
acccagcccg agctccagtg ggcgtggact 
aatctcgcgg agcgtaaacg gctggtggcg 
gagcagaaca aggaaaacca gaaccccaat 
tccgccaggt acatggagct ggtcgggtgg 
caatggatcc aggaggacca ggcgtcctac 
tcacaaatca aggccgcgct ggacaatgcc 
ccggactacc tggtgggcca gaacccgccg 
atcctcgaga tgaacgggta cgatccgcag 
caaaagaagt tcgggaagag gaacaccatc 
accaacatcg cggaagccat cgcccacgcc 
aatgagaact ttccgttcaa cgattgcgtc 
aagatgacgg ccaaggtcgt agagagcgcc 
gtggaccaaa agtgcaagtc atcggcccag 
aacaccaaca tgtgcgcggt catcgacgga 
ctccaggacc ggatgttcaa gttcgagctc 
gtcaccaagc aggaagtcaa agactttttc 
actcacgagt tttacgtcag aaagggtgga 
gatataagtg agcccaagcg ggcctgtccg 
gaagctccgg tggactacgc ggacaggtac 
aatctgatgc tttttccctg ccggcaatgc 
ttcacgcacg gggtcatgga ctgtgccgag 
tctgtcgtca gaaagcggac gtatcagaaa 
gcgcccgagg tggcctgctc ggcctgcgaa 
atggaacaat aa 



SEQ ID NO: 16 

AAV 4 capsid protein VP 2 



Thr 


Ala 


Pro 


Gly 


Lys 


Lys 


Arg 


Pro 


Leu 


He 


Glu 


Ser 


Pro 


Gin 


Gin 


Pro 


1 






5 










10 










15 




Asp 


Ser 


Ser 


Thr 


Gly 


He 


Gly 


Lys 


Lys 


Gly 


Lys 


Gin 


Pro 


Ala 


Lys 


Lys 






20 










25 










30 






Lys 


Leu 


Val 


Phe 


Glu 


Asp 


Glu 


Thr 


Gly 


Ala 


Gly Asp 


Gly 


Pro 


Pro 


Glu 




35 










40 










45 








Gly 


Ser 


Thr 


Ser 


Gly 


Ala 


Met 


Ser 


Asp 


Asp 


Ser 


Glu 


Met 


Arg 


Ala 


Ala 


50 










55 










60 










Ala 


Gly 


Gly 


Ala 


Ala 


Val 


Glu 


Gly 


Gly 


Gin 


Gly Ala 


Asp 


Gly 


Val 


Gly 


65 






70 










75 










80 


Asn 


Ala 


Ser 


Gly 


Asp 


Trp 


His 


Cys 


Asp 


Ser 


Thr 


Trp 


Ser 


Glu 


Gly 


His 










85 










90 










95 




Val 


Thr 


Thr 


Thr 


Ser 


Thr 


Arg 


Thr 


Trp 


Val 


Leu 


Pro 


Thr 


Tyr 


Asn 


Asn 








100 










105 










110 






His 


Leu 


Tyr 


Lys 


Arg 


Leu 


Gly 


Glu 


Ser 


Leu 


Gin 


Ser 


Asn 


Thr 


Tyr 


Asn 






115 










120 










125 








Gly 


Phe 


Ser 


Thr 


Pro 


Trp 


Gly 


Tyr 


Phe 


Asp 


Phe 


Asn 


Arg 


Phe 


His 


Cys 


130 










135 










140 








Gly 


His 


Phe 


Ser 


Pro 


Arg 


Asp 


Trp 


Gin 


Arg 


Leu 


He 


Asn 


Asn 


Asn 


Trp 


14 5 








150 










155 










160 


Met 


Arg 


Pro 


Lys 


Ala 


Met 


Arg 


Val 


Lys 


He 


Phe 


Asn 


He 


Gin 


Val 


Lys 






165 










170 










175 




Glu 


Val 


Thr 


Thr 


Ser 


Asn 


Gly 


Glu 


Thr 


Thr 


Val 


Ala 


Asn 


Asn 


Leu 


Thr 








180 








185 










190 












Gin 


lie 




Ala 


Asp 


Ser 


Ser 


Tyr 


Glu 


Leu 


Pro 


Tyr 


Val 






195 










200 










205 








Met 


Asp 


Ala 


Gly 


Gin 


Glu 


Gly 


Ser 


Leu 


Pro 


Pro 


Phe 


Pro 


Asn 


Asp 


Val 




210 










215 










220 








Thr 


Phe 


Met 


Val 


Pro 


Gin 


Tyr 


Gly 


Tyr 


Cys 


Gly 


Leu 


Val 


Thr 


Gly 


Asn 


225 










230 








235 










240 


Ser 


Gin 


Gin 


Gin 


Thr 


Asp 


Arg 


Asn 


Ala 


Phe 


Tyr 


Cys 


Leu 


Glu 


Tyr 


Phe 










245 






250 










255 




Pro 


Ser 


Gin 


Met 


Leu 


Arg 


Thr 


Gly 


Asn 


Asn 


Phe 


Glu 


He 


Thr 


Tyr 


Ser 
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tggttcgcgg tgaccaagac 
tgctacatcc ccaactacct 
aacatggacc agtatataag 
cagcatctga cgcacgtgtc 
tctgacgcgc cggtcatcag 
ctggtggacc gcgggatcac 
atctccttca acgccgcctc 
tccaaaatca tgagcctgac 
gaggacattt ccagcaaccg 
tacgcggcct ccgtcttcct 
tggctctttg ggccggccac 
gtgcccttct acggctgcgt 
gacaagatgg tgatctggtg 
aaggccatcc tgggcggaag 
atcgacccaa ctcccgtgat 
aactcgacca ccttcgagca 
accaagcgcc tggagcacga 
cggtgggcgt cagatcacgt 
gctagaaaga ggcccgcccc 
tcagttgcgc agccatcgac 
caaaacaaat gttctcgtca 
gagagaatga atcagaatgt 
tgcttccccg tgtcagaatc 
ctgtgtccga ttcatcacat 
ctggccaatg tggacttgga 



gcgtaatggc 420 

gctccccaag 480 

cgcctgtttg 540 

gcagacgcag 600 

gtcaaaaacc 660 

gtcagaaaag 720 

caactcgcgg 7 80 

aaagacggct 840 

catctaccga 900 

gggctgggcg 960 

gacgggtaaa 1020 

gaactggacc 1080 

ggaggagggc 1140 

caaggtgcgc 12 00 

cgtcacctcc 1260 

ccaacaacca 1320 

ctttggcaag 13 80 

gaccgaggtg 14 4 0 

caatgacgca 1500 

gtcagacgcg 1560 

cgtgggtatg 1620 

ggacatttgc 1680 

tcaacccgtg 1740 

catggggagg 180 0 

tgactgtgac 1860 
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260 265 270 



Phe 


Glu 


Lys 
275 


Val 


Pro 


Phe 


His 


Ser 
280 


Met 


Tyr 


Ala 


His 


Ser 
285 


Gin 


Ser 


Leu 


Asp 




Leu 


Met 


Asn 


Pro 


Leu 


He 


Asp 


Gin 


Tyr 


Leu 


Trp 


Gly 


Leu 


Gin 


290 










295 










300 










Sg 2T 


Thr 


Thr 


Thr 


Glv 


Thr 


Thr 


Leu 


Asn 


Ala 


Gly 


Thr 


Ala 


Thr 


Thr 


Asn 


305 








310 










315 










320 


Phe 


Thr 


Lvs 


Leu 


Arg 


Pro 


Thr 


Asn 


Phe 


Ser 


Asn 


Phe 


Lys 


Lys 


Asn 


Trp 








325 










330 










335 




Leu 


Pro 


Glv 


Pro 


Ser 


He 


Lys 


Gin 


Gin 


Gly 


Phe 


Ser 


Lys 


Thr 


Ala 


Asn 






340 










345 










350 






Gin 


Asn 


A y *- 


Lys 


He 


Pro 


Ala 


Thr 


Gly 


Ser 


Asp 


Ser 


Leu 


He 


Lys 


Tyr 






355 








360 










365 








Glu 


Thr 
370 


His 


Ser 


Thr 


Leu 


ASD 
375 


Gly 

Jr 


Arq 


Trp 


Ser 


Ala 
380 


Leu 


Thr 


Pro 


Gly 


C X.\J 


Pro 


Met 


Ala 


Thr 


Ala 


Glv 


Pro 


Ala 


Asp 


Ser 


Lys 


Phe 


Ser 


Asn 


ser 


385 










390 










395 










400 


Gin 


Leu. 


He 


Phe 


Ala 
405 


Gly 


Pro 


Lys 


Gin 


Asn 
410 


Gly 


Asn 


Thr 


Ala 


Thr 
415 


Val 


Pro 


Glv 


Thr 


Leu 


He 


Phe 


Thr 


Ser 


Glu 


Glu 


Glu 


Leu 


Ala 


Ala 


Thr 


Asn 






420 










425 










430 






Ala 


Thr 


Asp 
435 


Thr 


Asp 


Met 


TrD 


Gly 
440 


Asn 


Leu 


Pro 


Gly 


Gly 
445 


Asp 


Gin 


Ser 


Asn 


Ser 
450 


Asn 


Leu 


Pro 


Thr 


val 
455 


ASP 


Arg 


Leu 


Thr 


Ala 
460 


Leu 


Gly 


Ala 


Val 


Pro 


Glv 


Met 


Val 


Trp 


Gin 


Asn 


Arcr 


Asp 


He 


Tyr 


Tyr 


Gin 


Gly 


Pro 


He 


465 






470 










475 










480 




Ala 


Lys 


He 


Pro 


His 


Thr 


Asp 


Gly 


His 


Phe 


His 


Pro 


Ser 


Pro 


Leu 






485 










490 










495 




He 


Gly 


Glv 


Phe 


Glv 


Leu 


Lvs 


His 


Pro 


Pro 


Pro 


Gin 


He 


Phe 


lie 


Lys 




500 










505 










510 






As n 


TVi r 

J. HJL 


Pro 
515 


v a. j. 


Pro 


Ala 


Asn 


Pro 
520 


Ala 


Thr 


Thr 


Phe 


Ser 
525 


Ser 


Thr 


Pro 


Val 


Asn 


Ser 


Phe 


He 


Thr 


Gin 


Tyr 


Ser 


Thr 


Gly 


Gin 


Val 


Ser 


val 


Gin 




530 










535 










540 








Glu 


He 


Asp 


Trp 


Glu 


He 


Gin 


Lys 


Glu 


Arg 


Ser 


Lys 


Arg 


Trp 


Asn 


Pro 


545 






550 










555 










560 


val 


Gin 


Phe 


Thr 


Ser 


Asn 


Tyr 


Gly 


Gin 


Gin 


Asn 


Ser 


Leu 


Leu 


Trp 


Ala 










565 








570 










575 




Pro 


Asp 


Ala 


Ala 


Gly 


Lys 


Tyr 


Thr 


Glu 


Pro 


Arg 


Ala 


He 


Gly 


Thr 


Arg 






580 










585 










590 






Tyr 


Leu 


Thr 


His 


His 


Leu 























595 



SEQ ID NO: 17 

AAV4 capsid protein VP 2 gene 



acggctcctg 
ggtatcggca 
ggagcaggcg 
atgcgtgcag 
aatgcctcgg 
agcaccagaa 
agcctgcagt 
cgcttccact 
atgcgaccca 
tcgaacggcg 
tcgtcgtacg 
cccaacgacg 
tcgcagcaac 
ctgcggactg 
atgtacgcgc 
tggggactgc 
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gaaagaagag 
aaaaaggcaa 
acggaccccc 
cagctggcgg 
gtgattggca 
cctgggtctt 
ccaacaccta 
gccacttc 
aagccatgcg 
agacaacggt 
aactgccg 
tctttatggt 
agactgacag 
gcaacaactt 
acagccagag 
aatcgaccac 



accgttgatt gaatcccccc agcagcccga ctcctccacg 60 

gcagccggct aaaaagaagc tcgttttcga agacgaaact 12 0 

tgagggatca acttccggag ccatgtctga tgacagtgag 18 0 

agctgcagtc gagggsggac aaggtgccga tggagtgggt 24 0 

ttgcgattcc acctggtctg agggccacgt cacgaccacc 3 00 

gcccacctac aacaaccacc tntacaagcg actcggagag 360 

caacggattc tccaccccct ggggatactt tgacttcaac 42 0 

rc ao e^^ ac- ^geagogac tcatcaacaa caactggggc 480 

ggtcaaaatc ttcaacatcc aggtcaagga ggtcacgacg 54 0 

ggctaataac cttaccagca cggttcagat ctttgcggac 600 

cgt g a *~gg at " gx^gggtcaag agggcagcct gcctcctttt 660 

gccccagtac ggctactgtg gactggtgac cggcaacact 720 

aaatgccttc tactgcctgg agtactttcc ttcgcagatg 780 

tgaaattacg tacagttttg agaaggtgcc tttccactcg 840 

cctggaccgg ctgatgaacc ctctcatcga ccagtacctg 900 

caccggaacc accctgaatg ccgggactgc caccaccaac 960 



89 



ATTORNEY DOCKET NO. 14014.0427U1 



tttaccaagc 
tcaatcaagc 
gggtcagaca 
ctgacccccg 
cagctcatct 
atcttcacct 
aacctacctg 
ttgggagccg 
tgggccaaga 
gggctgaaac 
gcaacgacct 
gtgtcggtgc 
gtccagttta 
gggaaataca 



tgcggcctac 
agcagggctt 
gtctcatcaa 
gacctccaat 
ttgcggggcc 
ctgaggagga 
gcggtgacca 
tgcctggaat 
ttcctcatac 
acccgcctcc 
tcagctccac 
agattgactg 
cctccaacta 
ctgagcctag 



caacttttcc 
ctcaaagact 
atacgagacg 
ggccacggct 
taaacagaac 
gctggcagcc 
gagcaacagc 
ggtctggcaa 
cgatggacac 
tcaaattttt 
tccggtaaac 
ggagatccag 
cggacagcaa 
ggctatcggt 



aactttaaaa 
gccaatcaaa 
cacagcactc 
ggacctgcgg 
ggcaacacgg 
accaacgcca 
aacctgccga 
aacagagaca 
tttcacccct 
atcaagaaca 
tccttcatta 
aaggagcggt 
aactctctgt 
acccgctacc 



agaactggct 
actacaagat 
tggacggaag 
acagcaagtt 
ccaccgtacc 
ccgatacgga 
ccgtggacag 
tttactacca 
caccgctgat 
ccccggtacc 
ctcagtacag 
ccaaacgctg 
tgtgggctcc 
tcacccacca 



gcccgggcct 
ccctgccacc 
atggagtgcc 
cagcaacagc 
cgggactctg 
catgtggggc 
actgacagcc 
gggtcccatt 
tggtgggttt 
tgcgaatcct 
cactggccag 
gaaccccgag 
cgatgcggct 
cctgtaataa 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



SEQ ID NO: 18 

AAV4 caps id protein VP 3 



Met 


Ser 


Asp 


Asp 


Ser 
5 


Glu 


Met 


Arg 


1 

Glu 


Gly 


Gly 


Gin 


Gly 


Ala 


Asp 


vjiy 








20 










His 


Cys 


Asp 


Ser 


Thr 


Trp 


Ser 


Glu 






35 










4 U 


Arg 


Thr 


Trp 


Val 


Leu 


Pro 


Thr 


Tyr 




50 










5 5 




Gly 


Glu 


Ser 


Leu 


Gin 


Ser 


Asn 


Thr 


65 










70 






Gly 


Tyr 


Phe 


Asp 


Phe 


Asn 


Arg 


Phe 










85 








Trp 


Gin 


Arg 


Leu 


He 


Asn 


Asn 


Asn 








100 










Arg 


Val 


Lys 


lie 


Phe 


Asn 


He 


Gin 






115 










12 0 


Gly 


Glu 


Thr 


Thr 


Val 


Ala 


Asn 


Asn 




130 










13 5 




Ala 


Asp 


Ser 


Ser 


Tyr 


Glu 


Leu 


Pro 


145 
















Gly 


Ser 


Leu 


Pro 


Pro 


Phe 


Pro 


Asn 








165 








Gly 


Tyr 


Cys 


Gly 


Leu 


Val 


Thr 


Gly 








180 










Arg 


Asn 


Ala 


Phe 


Tyr 


Cys 


Leu 


Glu 






195 










200 


Thr 


Gly 


Asn 


Asn 


Phe 


Glu 


He 


Thr 




210 










215 




His 


Ser 


Met 


Tyr 


Ala 


His 


Ser 


Gin 


225 










230 






Leu 


lie 


Asp 


Gin 


Tyr 


Leu 


Trp 


Gly 










245 








Thr 


Leu 


Asn 


Ala 


Gly 


Thr 


Ala 


Thr 








260 










Thr 


Asn 


Phe 


Ser 


Asn 


Phe 


Lys 


Lys 






275 










280 


Lys 


Gin 


Gin 


Gly 


Phe 


Ser 


Lys 


Thr 


290 










2 95 




Ala 


Thr 


Gly 


Ser 


Asp 


Ser 


Leu 


He 


305 










310 






Asp 


Gly 


Arg 


Trp 


Ser 


Ala 


Leu 


Thr 










325 








Gly 


Pro 


Ala 


Asp 


Ser 


Lys 


Phe 


Ser 








340 










Pro 


Lys 


Gin 


Asn 


Gly 


Asn 


Thr 


Ala 



233334_I7.DOC 



Aia 


M.X. cL 


7V "1 -a 


Vj-L y 




Ala 


Ala 


val 




10 










15 




Val 


vjj.y 


Asn 


Al a 


OCL 


Glv 


Asp 


Trn 












3 0 






o±y 


nlS 


Vdl 


X III. 


Thr 

X 11X 


Thr 

X 11X 


Ser 


Thr 








4 5 








Asn 


Asn 


nib 


Leu 


x yi 


Lys 


A ra 


Leu 








6 0 










Tyr 


Asn 






Ser 


Thr 


Pro 


Trn 




7 5 










80 


TT -J Q 

rlx. S 


Cys 


TJ -i d 

ni o 


rile 


Cpv- 


Pro 


Ara 


■ Asp 




an 










95 




Trp 




ne c 


Arg 


pro 


xjy 0 


Ala 


Met 


105 










110 






vai 


Lys 


vj±U 


vai 


X 11X7 


X XIX 


Q A r 


7\cn 

«.5>11 








X O 








Leu 


Tnr 


Ser 


x nr 


Vcl-L 




Tip 
lie 


Php 
Jriic 








i /in 
J- *± u 










Tyr 


vai 




Asp 


Ala 


VJ-L jr 


Gin 


Glu 




155 










160 


Asp 


Val 


Phe 


Met 


Val 


Pro 


Gin 


Tyr 


170 










175 




Asn 


Thr 


Ser 


Gin 


Gin 


Gin 


Thr 


Asp 


185 










190 






Tyr 


Phe 


Pro 


Ser 


Gin 


Met 


Leu 


Arg 








205 








Tyr 


Ser 


Phe 


Glu 


Lys 


Val 


Pro 


Phe 








220 










Ser 


Leu 


Asp 


Arg 


Leu 


Met 


Asn 


Pro 






235 










240 


Leu 


Gin 


Ser 


Thr 


Thr 


Thr 


Gly 


Thr 




250 










255 




Thr 


Asn 


Phe 


Thr 


Lys 


Leu 


Arg 


Pro 


265 










270 






Asn 


Trp 


Leu 


Pro 


Gly 


Pro 


Ser 


He 










285 








Ala 


Asn 


Gin 


Asn 


Tyr 


Lys 


He 


Pro 








300 










Lys 


Tyr 


Glu 


Thr 


His 


Ser 


Thr 


Leu 


315 










320 


Pro 


Gly 


Pro 


Pro 


Met 


Ala 


Thr 


Ala 




330 










335 




Asn 


Ser 


Gin 


Leu 


He 


Phe 


Ala 


Gly 


345 










350 






Thr 


Val 


Pro 


Gly 


Thr 


Leu 


He 


Phe 



90 
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3 5 5 










360 










365 


Thr 






Thr 


Ser 


Glu 


Glu 


Glu 


Leu 


Ala 


Ala 


Thr 


Asn 


Ala 


Thr 


Asp 


Asp 


Met 




^ f \J 










375 










380 








Thr 




Gly 


Asn 


Leu 


Pro 


Gly Gly 


Asp 


Gin 


Ser 


Asn 


Ser 


Asn . 


Leu 


Pro 




















395 










400 


val 


Asp 


Arq 


Leu 


Thr 


HI ct 


Leu 


Gly 


Ala 


Val 


Pro 


Gly 


Met 


val 


Trp 


Gin 






405 










410 










415 


His 


as n 




Asp 


lie 


Tvr 




Gin 


Glv 


Pro 


He 


Trp 


Ala 


Lys 


He 


Pro 






420 










425 










430 






Thr 


Asp 


Glv 


His 


Phe 


nib 


Pro 


Ser 


Pro 


Leu 


He 


Gly 


Gly 


Phe 


Gly 


Leu 




4 3 5 










44 0 










445 






Ala 


Lys 


His 


Pro 


Pro 


Pro 




lie 


Phe 


lie 


Lys 


Asn 


Thr 


Pro 


Val 


Pro 


4. n 










4. 5 5 










460 






He 


Thr 


Asn 


Pro 


Ala 


Thr 


Thr 


Phe 


Ser 


Ser 


Thr 


Pro 


Val 


Asn 


Ser 


Phe 












470 










475 








He 


480 


Gin 


Tvr 


Ser 


Thr 


Gly 


Gin 


Val 


Ser 


Val 


Gin 


He 


Asp 


Trp 


Glu 


Gin 








485 










490 










495 




Lys 


Glu 


Arg 


Ser 


Lys 


Arg 


Trp 


Asn 


Pro 


Glu 


Val 


Gin 


Phe 


Thr 


Ser 


Asn 






500 










505 










510 






Tyr 


Gly 


Gin 


Gin 


Asn 


Ser 


Leu 


Leu 


Trp 


Ala 


Pro 


Asp 


Ala 


Ala 


Gly 


Lys 


515 










520 










525 




His 




Tyr 


Thr 


Glu 


Pro 


Arg 


Ala 


lie 


Gly 


Thr 


Arg 


Tyr 


Leu 


Thr 


His 


Leu 


530 










535 










540 











SEQ ID NOrl9 

AAV4 capsid protein VP 3 gene 



atgcgtgcag 
aatgectegg 
agcaccagaa 
agectgeagt 
cgcttccact 
atgcgaccca 
tegaaeggeg 
tegtegtacg 
cccaacgacg 
tcgcagcaac 
ctgcggactg 
atgtacgcgc 
tggggactgc 

tttaccaagc 
tcaatcaagc 
gggtcagaca 
ctgacccccg 
cagctcatct 
atcttcacct 
aacctacctg 
ttgggagccg 
tgggecaaga 
gggctgaaac 
gcaacgacct 
gtgtcggtgc 
gtccagttta 
gggaaataca 



cagctggegg 
gtgattggca 
cctgggtctt 
ccaacaccta 
gccacttctc 
aagccatgcg 
agacaaeggt 
aactgeegta 
tctttatggt 
agactgacag 
gcaacaactt 
acagecagag 
aatcgaccac 
tgcggcctac 
agcagggctt 
gtctcatcaa 
gacctccaat 
ttgeggggee 
ctgaggagga 
gcggtgacca 
tgcctggaat 
ttcctcatac 
acccgcctcc 
tcagctctac 
agattgactg 
cctccaacta 
ctgagectag 



agetgeagtc 
ttgegattec 
gcccacctac 
caaeggatte 
accacgtgac 
ggtcaaaatc 
ggctaataac 
cgtgatggat 
gccccagtac 
aaatgectte 
tgaaattacg 
cctggaccgg 
caccggaacc 
caacttttcc 
ctcaaagact 
atacgagacg 
ggccacggct 
taaacagaac 
gctggcagcc 
gagcaacagc 
ggtctggcaa 
cgatggacac 
tcaaattttt 
teeggtaaac 
ggagatccag 
eggacagcaa 
ggctatcggt 



gagggsggac 
acctggtctg 
aacaaccacc 
tccaccccct 
tggcagegae 
ttcaacatcc 
cttaccagca 
gcgggtcaag 
ggctactgtg 
tactgectgg 
tacagttttg 
ctgatgaacc 
accctgaatg 
aactttaaaa 
gecaatcaaa 
cacagcactc 
ggacctgegg 
ggcaacaegg 
accaacgcca 
aacctgccga 
aacagagaca 
tttcacccct 
atcaagaaca 
tccttcatta 
aaggagcggt 
aactctctgt 
acccgctacc 



aaggtgccga 
agggecaegt 
tntacaagcg 
ggggatactt 
tcatcaacaa 
aggtcaagga 
eggttcagat 
agggcagect 
gactggtgac 
agtactttcc 
agaaggtgee 
ctctcatcga 
ccgggactgc 
agaactggct 
actacaagat 
tggacggaag 
acagcaagtt 
ccaccgtacc 
ccgatacgga 
ccgtggacag 
tttactacca 
caccgctgat 
ccccggtacc 
ctcagtacag 
ccaaacgctg 
tgtgggctcc 
tcacccacca 



tggagtgggt 
cacgaccacc 
acteggagag 
tgacttcaac 
caactggggc 
ggtcacgacg 
etttgeggae 
gcctcctttt 
cggcaacact 
ttcgcagatg 
tttccactcg 
ccagtacctg 
caccaccaac 
gcccgggcct 
ccctgccacc 
atggagtgcc 
cagcaacagc 
egggactctg 
catgtggggc 
actgacagcc 
gggtcccatt 
tggtgggttt 
tgegaatect 
cactggccag 
gaaccccgag 
egatgegget 
cctgtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1617 



SEQ ID NO: 20 

AAV 4 ITR "flop" orientatioa- 

ttggccactc cctctatgcg cgctcgctca 
gtctggagac ctttggtgtc cagggcaggg 
agtggccaa 



ctcactcggc cctgcggcca gaggceggea 
ccgagtgagt gagegagege gcatagaggg 



91 
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SEQ ID NO: 21 

TCTAGTCTAG ACTTGGCCAC TCCCTCTCTG CGCGC 



35 



SEQ ID NO: 22 
AGGCCTTAAG AGCAGTCGTC CACCACCTTG TTCC 



34 



SEQ ID NO: 23 

tggcactctc 

caaagagctg 

gcgaacgcga 

tgtcataatg 

tgttttatcc 

aagaccgagt 

gccatggcta 

cctggaattt 

gagtcagatt 

cgccgcgtgt 

cagtttgaaa 

tcttccatgg 

ttccagggaa 

ggagccaata 

ccggagcttc 

gaggagcgca 

gcggcttcgc 

tacatggcgc 

caggaaaatc 

aaggccgcgc 

ctcgtgggga 

atgaatggct 

ttcaacaaga 

gcggaggcca 

tttcccttta 

aacaaggtgg 

aaatgtaaat 

atgtgtgtgg 

cgcatgttca 

caggaagtca 

tttaaagttc 

ctgggtgacg 

gttcccgaga 

aattggaatt 

aacaaatgtg 

gtaactcact 

tttggggatt, 

gttgatcacc 

cttgaagcgg 

cttgtgctgc 

gtcaacaggg 

gcgggagaca 

gccgacgaca 

gttctcgaac 

atagacgacc 

acctcgtcag 

ccagcctcaa 

aataaccaag 

tggatggggg 

aaccaccagt 

tttggataca 
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ccccctgtcg 
ccagacgacg 
caggggggag 
atgtaatgct 
aataggaaga 
gaacgagccc 
ccttctatga 
ctgacagcfct 
taaatttgac 
tcctgtacga 
agggatctga 
tcctcggccg 
ttgaacccca 
aggtggtgga 
agtgggcgtg 
aacggctcgt 
agcgtgagtt 
tcgtcaactg 
aggagagcta 
tcgacaacgc 
gctccgttcc 
acgacccggc 
ggaacaccgt 
tcgcccacac 
atgactgtgt 
ttgaatccgc 
cctctgttca 
tggtggatgg 
aatttgaact 
aggacttttt 
ccagggaatt 
tcaccaatac 
cgcctcgcag 
caaggtatga 
atgaatgtga 
gtcaaatttg 
ttgacgatgc 
ctccagattg 
gcccaccgaa 
ctggttataa 
cagacgaggt 
acccctacct 
catccttcgg 
cttttggcct 
actttccaaa 
acgccgaagc 
gtttgggagc 
gtgccgatgg 
acagagtcgt 
accgagagat 
gcaccccctg 



cgttcgctcg 

gccctctggc 

agtgccacac 

tattgtcacg 

aagcgcgcgt 

gccgccattc 

agtcattgtt 

tgtggactgg 

tctggttgaa 

gtggaacaaa 

atattttcat 

ctacgtgagt 

gatcaacgac 

ttctgggtat 

gacaaacctg 

cgcgcagttt 

ctcggctgac 

gctcgtggag 

cctctccttc 

gaccaaaatt 

cgaggacatt 

ctacgcggga 

ctggctctac 

tgtgcccttt 

ggacaaaatg 

caaggccatc 

aattgattct 

gaattccacg 

gactaagcgg 

tgcttgggca 

ggcgggaact 

tagctataaa 

ttcagacgtg 

ttgcaaatgt 

atatttgaat 

tcatgggatt 

caataaagaa 

gttggaagaa 

accaaaaccc 

ctatctcgga 

cgcgcgagag 

caagtacaac 

gggaaacctc 

ggttgaagag 

aagaaagaag 

tggacccagc 

tgatacaatg 

agtgggcaat 

caccaagtcc 

caaaagcggc 

ggggtacttt 



ctcgctggct 

cgtcgccccc 

tctcaagcaa 

cgatagttaa 

atgagttctc 

tttgctctgg 

cgcgtcccat 

gtaactggtc 

cagcctcagt 

ttttccaagc 

ctgcacacgc 

cagattcgcg 

tgggtcgcca 

attcccgcct 

gacgagtata 

ctggcagaat 

ccggtcatca 

cacggcatca 

aactccaccg 

atgagtctga 

tcaaaaaaca 

tccatcctct 

ggacccgcca 

tacggctgcg 

ctcatttggt 

ctggggggct 

acccctgtca 

acctttgaac 

ctcccgccag 

aaggtcaatc 

aaaggggcgg 

agtctggaga 

actgttgatc 

gactatcatg 

cggggcaaaa 

cccccctggg 

cagtaaataa 

gttggtgaag 

aatcagcagc 

cccggaaacg 

cacgacatct 

cacgcggacg 

ggaaaggcag 

ggtgctaaga 

gctcggaccg 

ggatcccagc 

tctgcgggag 

gcctcgggag 

acccgaacct 

tccgtcgacg 

gactttaacc 

92 



cgtttggggg 

ccaaacgagc 

gggggttttg 

tgattaacag 

gcgagacttc 

actgctagag 

ttgacgtgga 

aaatttggga 

tgacggtggc 

aggagtccaa 

ttgtggagac 

cccagctggt 

tcaccaaggt 

acctgctgcc 

aattggccgc 

cctcgcagcg 

aaagcaagac 

cttccgagaa 

gcaactctcg 

caaaaagcgc 

gaatctggca 

acggctggtg 

cgaccggcaa 

tgaactggac 

gggaggaggg 

caaaggtgcg 

ttgtaacttc 

accagcagcc 

attttggcaa 

aggtgccggt 

agaaatctct 

agcgggccag 

ccgctcctct 

ctcaatttga 

atggatgtat 

aaaaggaaaa 

agcgagtagt 

gtcttcgcga 

atcaagatca 

gtctcgatcg 

cgtacaacga 

ccgagtttca 

tctttcaggc 

cggcccctac 

aagaggactc 

agctgcaaat 

gtggcggccc 

attggcattg 

gggtgctgcc 

gaagcaacgc 

gcttccacag 



ggtggcagct 60 

cagcgagcga 120 

taagcagtga 180 

tcatgtgatg 240 

cggggtataa 300 

gaccctcgct 360 

ggaacatctg 420 

gctgcctcca 4 80 

tgatagaatt 540 

attctttgtg 600 

ctccggcatc 660 

gaaagtggtc 72 0 

aaagaagggc 7 80 

gaaggtccaa 84 0 

cctgaatctg 900 

ctcgcaggag 960 

ttcccagaaa 1020 

gcagtggatc 1080 

gagccagatc 1140 

ggtggactac 1200 

aatttttgag 1260 

tcagcgctcc 1320 

gaccaacatc 1380 

caatgaaaac 1440 

aaagatgacc 15 0 0 

ggtcgatcag 1560 

caatacaaac 1620 

gctggaggac 1680 

gattactaag 1740 

gactcacgag 1800 

aaaacgccca 1860 

gctctcattt 1920 

gcgaccgctc 1980 

caacatttct 2040 

ctgtcacaat 2100 

cttgtcagat 2160 

catgtctttt 2220 

gtttttgggc 2280 

agcccgtggt 2340 

aggagagcct 24 00 

gcagcttgag 2460 

ggagaagctc 2520 

caagaaaagg 25 80 

cggaaagcgg 2 64 0 

caagccttcc 2700 

cccagcccaa 2760 

attgggcgac 2 82 0 

cgattccacg 2880 

cagctacaac 2 94 0 

caacgcctac 3000 

ccactggagc 3 06 0 
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ccccgagact 
gtcaaaatct 
gccaacaacc 
gtcgtcggca 
ccgcagtacg 
agcttcttct 
tttacctaca 
ttcaagctgg 
actggcggag 
tggttcccgg 
gccagtgtca 
gtgcccccgc 
gagaacacta 
gagggcaaca 
aacgtcggcg 
acgtacaacc 
caaggaccca 
atgggcggat 
cccggaaata 
accgggcagg 
aacccagaga 
gacagcaccg 
ctttaaccca 
gcctcttgtg 
gagtgtggca 
cagctcaaag 
agcgagcgaa 



ggcaaagact 
tcaacattca 
tcacctccac 
acgggaccga 
gttacgcgac 
gcctagagta 
actttgagga 
ccaacccgct 
tccagttcaa 
ggcccatggg 
gcgccttcgc 
agccgaacgg 
tgatcttcaa 
tgctcatcac 
ggcagatggc 
tccaggaaat 
tctgggccaa 
tcggactcaa 
tcaccagctt 
tcaccgtgga 
tccagtacac 
gggaatacag 
ttcatgtcgc 
gtcattcaat 
ctctcccccc 
agctgccaga 
cgcgacaggg 



catcaacaac 
agtcaaagag 
cgtccaagtg 
gggatgcctg 
gctgaaccgc 
ctttcccagc 
ggtgcccttc 
ggtggaccag 
caagaacctg 
ccgaacccag 
cacgaccaat 
catgaccaac 
cagccagccg 
cagcgagagc 
caccaacaac 
cgtgcccggc 
gatcccagag 
acacccaccg 
ctcggacgtg 
gatggagtgg 
aaacaactac 
aaccaccaga 
ataccctcaa 
gaataacagc 
tgtcgcgttc 
cgacggccct 
gsgagagtgc 



tactggggct 
gtcacggtgc 
tttacggacg 
ccggccttcc 
gacaacacag 
aagatgctga 
cactccagct 
tacttgtacc 
gccgggagat 
ggctggaacc 
aggatggagc 
aacctccagg 
gcgaacccgg 
gagacgcagc 
cagagctcca 
agcgtgtgga 
acgggggcgc 
cccatgatgc 
cccgtcagca 
gagctcaaga 
aacgaccccc 
cctatcggaa 
taaaccgtgt 
ttacaacatc 
gctcgctcgc 
ctggccgtcg 
ca 



tcagaccccg 
aggactccac 
acgactacca 
ctccgcaggt 
aaaatcccac 
gaacgggcaa 
tcgctcccag 
gcttcgtgag 
acgccaacac 
tgggctccgg 
tcgagggcgc 
gcagcaacac 
gcaccaccgc 
cggtgaaccg 
ccactgcccc 
tggagaggga 
actttcaccc 
tcatcaagaa 
gcttcatcac 
aggaaaactc 
agtttgtgga 
cccgatacct 
attcgtgtca 
tacaaaacct 
tggctcgttt 
cccccccaaa 



gtccctcaga 
caccaccatc 
gctgccctac 
ctttacgctg 
cgagaggagc 
caactttgag 
tcagaacctg 
cacaaataac 
ctacaaaaac 
ggtcaaccgc 
gagttaccag 
ctatgccctg 
cacgtacctc 
cgtggcgtac 
cgcgaccggc 
cgtgtacctc 
ctctccggcc 
cacgcctgtg 
ccagtacagc 
caagaggtgg 
ctttgccccg 
tacccgaccc 
gtaaaatact 
ccttgcttga 

gggggggtgg 

cgagccagcg 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4652 



SEQ ID NO: 24 



Met 


Ala 


Leu 


Val 


Asn 


Trp 


Leu 


Val 


Glu 


His 


Gly 


He 


Thr 


Ser 


Glu 


Lys 


1 








5 








10 










15 




Gin 


Trp 


He 


Gin 


Glu 


Asn 


Gin 


Glu 


Ser 


Tyr 


Leu 


Ser 


Phe 


Asn 


Ser 


Thr 






20 










25 










30 






Gly 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Thr 


Lys 




35 










40 










45 








lie 


Met 
50 


Ser 


Leu 


Thr 


Lys 


Ser 
55 


Ala 


Val 


Asp 


Tyr 


Leu 
60 


Val 


Gly 


Ser 


Ser 


Val 


Pro 


Glu 


Asp 


lie 


Ser 


Lys 


Asn 


Arg 


He 


Trp 


Gin 


He 


Phe 


Glu 


Met 


65 








70 










75 










80 


Asn 


Gly 


Tyr 


Asp 


Pro 


Ala 


Tyr 


Ala 


Gly 


Ser 


He 


Leu 


Tyr 


Gly 


Trp 


Cys 




85 










90 










95 




Gin 


Arg 


Ser 


Phe 


Asn 


Lys 


Arg 


Asn 


Thr 


Val 


Trp 


Leu 


Tyr 


Gly 


Pro 


Ala 






100 










105 










110 






Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 






115 








120 










125 








Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 




130 






135 










140 










Cys 


Val 


Asp 


Lys 


Met 


Leu 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Asn 


145 






150 










155 








Val 


160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly 


Gly 


Ser 


Lys 


Arg 








165 










170 










175 




Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


val 


Gin 


He 


Asp 


Ser 


Thr 


Pro 


Val 






180 










185 










190 






He 


Val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met: 


Cys 


Val 


Val 


Val 


Asp 


Gly 


Asn 


Ser 






195 










200 










205 






Phe 


Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Glu 


Asp 


Arg 


Met 


Phe 


Lys 




210 










215 










220 








Gin 


Glu 


Leu 


Thr 


Lys 


Arg 


Leu 


Pro 


Pro 


Asp 


Phe 


Gly 


Lys 


He 


Thr 


Lys 


225 






230 










235 










240 


Glu 


val 


.Lys 


Asp 


Phe 


Phe 


Ala 


Trp 


Ala 


Lys 


val 


Asn 


Gin 


Val 


Pro 


Val 






245 










250 










255 




Thr 


His 


Glu 


Phe 


Lys 


Val 


Pro 


Arg 


Glu 


Leu 


Ala 


Gly 


Thr 


Lys 


Gly 


Ala 
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260 



Glu 


Lys 


Ser 


Leu 


Lys 


Arg 


Pro 


Leu 






275 












Lys 


Ser 


Leu 


Glu 


Lys 


Arg 


Ala 


Arg 




2 90 














Arg 


Ser 


Ser 


Asp 


vai 


Tnr 


vai 


Asp 


i r\ c 
















Trp 


Asn 


Ser 


Arg 


Tyr 


Asp 


Cys 


Lys 










325 








Asn 


Ti- 
ne 


Ser 


Asn 


Lys 


Cys 


Asp 










340 










Asn 


Gly 


Cys 


lie 


Cys 


nlS 


Asn 


vai 






355 










360 


lie 


Pro 


Pro 


Trp 


Glu 


Lys 


Glu 


Asn 




370 










375 




Asp 


Ala 


Asn 


Lys 


G1U 


tin 






385 
















SEQ ID NO: 25 












Met 


Ala 


Thr 


Phe 


Tyr 


Glu 


vai 


lie 


1 








c 








Glu 


His 


Leu 


Pro 


Gly 


lie 


Ser 


TV a wv 

Asp 








2 0 










Gin 


lie 


Trp 


Glu 


Leu 


Pro 


Pro 


Glu 






35 












Glu 


Gin 


Pro 


Gin 


Leu 


Thr 


Val 


TV 1 _ 

Ala 




50 










55 




Tyr 


Glu 


Trp 


Asn 


Lys 


Pne 


Ser 


Lys 


65 










/ u 






Phe 


Glu 


Lys 


Gly 


Ser 


Glu 


Tyr 


pne 


















Ser 


Gly 


lie 


Ser 


Ser 


Met 


Val 


Leu 








100 










Ala 


Gin 


Leu 


Val 


Lys 


Val 


Val 


Phe 






115 










12 0 


Asp 


Trp 


Val 


Ala 


lie 


Thr 


Lys 


Val 




130 










13 5 




Val 


Asp 


Ser 


Gly 


Tyr 


lie 


Pro 


Ala 


145 










15 0 






Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 










165 








Leu 


Asn 


Leu 


Glu 


Glu 


Arg 


Lys 


Arg 








180 










Ser 


Ser 


Gin 


Arg 


Ser 


Gin 


Glu 


Ala 






195 










*■> f\ r\ 


Asp 


Pro 


Val 


lie 


Lys 


Ser 


Lys 


Thr 




210 














Asn 


Trp 


Leu 


Val 


Glu 


His 


Gly 


lie 


225 










2 3 0 






Glu 


Asn 


Gin 


Glu 


Ser 


Tyr 


Leu 


Ser 










24 5 








Ser 


Gin 


lie 


Lys 


Ala 


Ala 


Leu 


Asp 








260 










Thr 


Lys 


Ser 


Ala 


Val 


Asp 


Tyr 


Leu 






275 










280 


lie 


Ser 


Lys 


Asn 


Arg 


lie 


Trp 


Gin 




290 










295 




Pro 


Ala 


Tyr 


Ala 


Gly 


Ser 


lie 


Leu 


305 










310 






Asn 


Lys 


Arg 


Asn 


Thr 


Val 


Trp 


Leu 










325 
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265 










270 






Gly Asp 


val 


Thr 


Asn 


Thr 


Ser 


Tyr 










285 








Leu 




Phe 


Val 


Pro 


Glu 


Thr 


Pro 








300 










Pro 


Ala 


Pro 


Leu 


Arq 


Pro 


Leu 


Asn 






315 










320 


Cys 


Asp 


Tyr 


His 


Ala 


Gin 


Phe 


Asp 




330 










335 




Cys 


Glu 


Tyr 


Leu 


Asn 


Arg 


Gly 


Lys 


345 










350 






Thr 


His 


Cys 


Gin 


He 


Cys 


His 


Gly 










365 








Leu 


Ser 


Asp 


Phe 


Gly Asp 


Phe 


Asp 



380 



val 


Arg 


val 


Pro 


trl ICS 


Q, on 


Val 


Glu 




10 










J- 3 




Ser 


Phe 


Vd -L 






Val 


Thr 


Glv 


25 
















Ser 


Asp 


Leu 


Asn 


Leu 


l nr 


Leu 


Val 
vai 


















Asp 


Arg 


x le 


Arg 


Arg 


Vdl 


rue 


eu 








oO 










Gin 


Glu 


Cor 

oer 


Lys 


-flit; 


DVip 


vai 


Gin 






75 










ft n 


His 


Leu 


fll s 


Thr 
J. in 


Leu 


val 


Glu 


Thr 




90 














Gly 


Arg 


Tyr 


V ct _L 


Q ^> -v-~ 


Gin 


He 




105 










Tin 






Gin Gly 


He 


Glu 


Pro 


Gin 


He 


Asn 










125 








Lys 


Lys 


Gly 


Gly 


Ala 


Asn 


Lys 


Val 








140 










Tyr 


Leu 


Leu 


Pro 


Lys 


Val 


Gin 


Pro 




155 










160 


Leu 


Asp 


Glu 


Tyr 


Lys 


Leu 


Ala 


Ala 




170 










175 




Leu 


Val 


Ala 


Gin 


Phe 


Leu 


Ala 


Glu 


185 










190 






Ala 


Ser 


Gin 


Arg 


Glu 


Phe 


Ser 


Ala 










205 








Ser 


Gin 


Lys 


Tyr 


Met 


Ala 


Leu 


Val 








220 










Thr 


Ser 


Glu 


Lys 


Gin 


Trp 


He 


Gin 






235 










240 


Phe 


Asn 


Ser 


Thr 


Gly 


Asn 


Ser 


Arg 




250 










255 




Asn 


Ala 


Thr 


Lys 


He 


Met 


Ser 


Leu 












270 






Val 


Gly 


Ser 


Ser 


Val 


Pro 


Glu 


Asp 








285 








He 


Phe 


Glu 


Met 


Asn 


Gly 


Tyr 


Asp 








30O 










Tyr 


Gly 


Trp 


Cys 


Gin 


Arg 


Ser 


Phe 






315 










320 


Tyr 


Gly 


Pro 


Ala 


Thr 


Thr 


Gly 


Lys 




330 










335 
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Thr 


Asn 


lie 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


Phe Tyr Gly Cys 








340 










345 










350 






Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 


Cvs 


Val 


Asp 


Lys 






355 










360 










365 






Glu 


Met 


Leu 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Asn 


Lvs 


Val 


Val 




370 








375 










380 










Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly Gly 


Ser 


Lys 


Val 


Arg 


Val 


Asp 


Gin 


Lys 


385 








390 










395 










400 


Cys 


Lys 


Ser 


Ser 


val 


Gin 


lie 


ASp 


Ser 


Thr 


Pro 


Val 


He 


Val 


Thr 


Ser 






405 










410 










415 




Asn 


Thr 


Asn 


Met 
420 


Cys 


Val 


Veil 


Veil 


Asp 

425 


Gly 


Asn 


Ser 


Thr 


Thr 
430 


Phe 


Glu 


His 


Gin 


Gin 
435 


Pro 


Leu 


Glu 


ASp 


Arg 
44 0 


Met 


Phe 


Lys 


Phe 


Glu 
445 


Leu 


Thr 


Lys 


Arg 


Leu 


Pro 


Pro 


Asp 


Phe 




Lys 


He 


Thr 


Lys 


Gin 


Glu 


Val 


Lys 


Asp 


450 










455 










460 




His 




Phe 


Phe 


Phe 


Ala 


Trp 


Ala 


Lys 


vai 


Asn 


Gin 


val 


Pro 


Val 


Thr 


Glu 


465 








470 










475 










480 


Lys 


Val 


Pro 


Arg 


Glu 


Leu 


Ala 

Aia 


<jiy 


Thr 


Lys 


Gly 


Ala 


Glu 


Lvs 


Ser 


Leu 








485 










490 










495 




Lys 


Arg 


Pro 


Leu 


Gly 


Asp 


Val 

VAX 


Thr 


Asn 


Thr 


Ser 


Tyr 


Lys 


Ser 


Leu 


Glu 




500 










505 










510 






Lys 


Arg 


Ala 


Arg 


Leu 


Ser 


i I1C 


Val 


Pro 


Glu 


Thr 


Pro 


Ara 


Ser 


Ser 


Asp 

w tr 


515 










520 










525 








Val 


Thr 


val 


Asp 


Pro 


Ala 


Pro 


Leu 


Arg 


Pro 


Leu 


Asn 


Trp 


Asn 


Ser 


Arg 




530 








535 










540 










Tyr 


Asp 


Cys 


Lys 


Cys 


Asp 


Tyr 


His 


Ala 


Gin 


Phe 


Asp 


Asn 


He 


Ser 


Asn 


545 








550 










555 










560 


Lys 


Cys 


Asp 


Glu 


Cys 


Glu 


Tyr 


Leu 


Asn 


Arg 


Gly 


Lys 


Asn 


Gly 


Cys 


He 






565 










570 










575 




Cys 


His 


Asn 


Val 


Thr 


His 


Cys 


Gin 


He 


Cys 


His 


Gly 


He 


Pro 


Pro 


Trp 






580 










585 










590 






Glu 


Lys 


Glu 


Asn 


Leu 


Ser 


Asp 


Phe 


Gly 


Asp 


Phe 


Asp 


Asp 


Ala 


Asn 


Lys 




595 










600 










605 








Glu 


Gin 
610 































SEQ ID NO: 2 6 



Met 


Ser 


Phe 


Val 


Asp 


His 


Pro 


Pro 


Asp 


Trp 


Leu 


Glu 


Glu 


Val 


Gly 


Glu 


1 








5 










10 










15 




Gly 


Leu 


Arg 


Glu 


Phe 


Leu 


Gly 


Leu 


GlU 


Ala 


Gly 


Pro 


Pro 


Lys 


Pro 


Lys 




20 










25 










30 






Pro 


Asn 


Gin 


Gin 


His 


Gin 


Asp 


Gin 


Ala 


Arg 


Gly 


Leu 


Val 


Leu 


Pro 


Gly 






35 










40 










45 






Val 


Tyr 


Asn 


Tyr 


Leu 


Gly 


Pro 


Gly 


Asn 


Gly 


Leu 


Asp 


Arg 


Gly 


Glu 


Pro 


50 








55 










60 








Glu 


Asn 


Arg 


Ala 


Asp 


Glu 


Val 


Ala 


Arg 


Glu 


His 


Asp 


He 


Ser 


Tyr 


Asn 


65 






70 










75 








Ala 


80 


Gin 


Leu 


Glu 


Ala 


Gly 


Asp 


Asn 


Pro 


Tyr 


Leu 


Lys 


Tyr 


Asn 


His 


Asp 










85 








90 










95 




Ala 


Glu 


Phe 


Gin 


Glu 


Lys 


Leu 


Ala 


Asp 


Asp 


Thr 


Ser 


Phe 


Gly 


Gly 


Asn 








100 








105 










110 






Leu 


Gly 


Lys 


Ala 


Val 


Phe" 


"Gin 


Ala 


Lys 


Lys 


Arg 


Val 


Leu 


Glu 


Pro 


Phe 




115 










120 










125 






He 


Gly 


Leu 


Val 


Glu 


Glu 








Pro 




Gly 


Lys 


Arg 


130 










135 










140 










Asp 


Asp 


His 


Phe 


Pro 


Lys 


Arg 


Lys 


Lys 


Ala 


Arg 


Thr 


Glu 


Glu 


Asp 


Ser 


145 








150 










155 










160 


Lys 


Pro 


Ser 


Thr 


Ser 


Ser 


Asp 


Ala 


Glu 


Ala 


Gly 


Pro 


Ser 


Gly 


Ser 


Gin 








165 








170 










175 




Gin 


Leu 


Gin 


He 


Pro 


Ala 


Gin 


Pro 


Ala 


Ser 


Ser 


Leu 


Gly Ala 


Asp 


Thr 








180 










185 










190 






Met 


Ser 


Ala 


Gly Gly 


Gly Gly 


Pro 


Leu 


Gly 


Asp 


Asn 


Asn 


Gin 


Gly 


Ala 
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195 

Asp Gly val Gly 
210 

Met Gly Asp Arg 
225 

Ser Tyr Asn Asn 

Gly Ser Asn Ala 
260 

Phe Asp Phe Asn 
275 

Arg Leu lie Asn 
290 

Lys lie Phe Asn 
305 

Thr Thr lie Ala 

Asp Asp Tyr Gin 
340 

Leu Pro Ala Phe 
355 

Ala Thr Leu Asn 
370 

Phe Phe Cys Leu 
385 

Asn Phe Glu Phe 

Phe Ala Pro Ser 
420 

Gin Tyr Leu Tyr 
435 

Phe Asn Lys Asn 
450 

Phe Pro Gly Pro 
465 

Val Asn Arg Ala 

Leu Glu Gly Ala 
500 

Asn Asn Leu Gin 
515 

Phe Asn Ser Gin 
530 

Gly Asn Met Leu 
545 

Val Ala Tyr Asn 

Thr Thr Ala Pro 
580 

Gly Ser Val Trp 
595 

Ala Lys lie Pro 
610 

Gly Gly Phe Gly 
625 

Thr Pro Val Pro 

Ser Phe lie Thr 
660 

Trp Glu Leu Lys 
675 

Tyr Thr Asn Asn 
6 90 

Ser Thr Gly Glu 
705 
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200 

Asn Ala Ser Gly 
215 

Val Val Thr Lys 
230 

His Gin Tyr Arg 
245 

Asn Ala Tyr Phe 

Arg Phe His Ser 
280 

Asn Tyr Trp Gly 
295 

He Gin Val Lys 
310 

Asn Asn Leu Thr 
325 

Leu Pro Tyr Val 

Pro Pro Gin Val 
360 

Arg Asp Asn Thr 

375 

Glu Tyr Phe Pro 
390 

Thr Tyr Asn Phe 
405 

Gin Asn Leu Phe 

Arg Phe Val Ser 
440 

Leu Ala Gly Arg 

455 

Met Gly Arg Thr 
470 

Ser Val Ser Ala 
485 

Ser Tyr Gin Val 

Gly Ser Asn Thr 
52 0 

Pro Ala Asn Pro 
535 

He Thr Ser Glu 
550 

Val Gly Gly Gin 
565 

Ala Thr Gly Thr 

Met Glu Arg Asp 
600 

Glu Thr Gly Ala 
615 

Leu Lys His Pro 
630 

Gly Asn He Thr 
645 

Gin Tyr Ser Thr 

Lys Glu Asn Ser 
680 

Tyr Asn Asp Pro 
695 

Tyr Arg Thr Thr 
710 



Asp Trp His Cys 
220 

Ser Thr Arg Thr 
235 

Glu He Lys Ser 
250 

Gly Tyr Ser Thr 
265 

His Trp Ser Pro 

Phe Arg Pro Arg 
300 

Glu Val Thr Val 
315 

Ser Thr Val Gin 
330 

Val Gly Asn Gly 
345 

Phe Thr Leu Pro 

Glu Asn Pro Thr 
380 

Ser Lys Met Leu 
395 

Glu Glu Val Pro 
410 

Lys Leu Ala Asn 
425 

Thr Asn Asn Thr 

Tyr Ala Asn Thr 
460 

Gin Gly Trp Asn 
475 

Phe Ala Thr Thr 
490 

Pro Pro Gin Pro 
505 

Tyr Ala Leu Glu 

Gly Thr Thr Ala 
540 

Ser Glu Thr Gin 
555 

Met Ala Thr Asn 
570 

Tyr Asn Leu Gin 
585 

Val Tyr Leu Gin 

His Phe His Pro 
620 

Pro Pro Met Met 
635 

Ser Phe Ser Asp 
650 

Gly Gin Val Thr 
665 

Lys Arg Trp Asn 

Gin Phe Val Asp 
700 

Arg Pro lie Gly 

715 

96 



205 

Asp Ser Thr Trp 

Trp Val Leu Pro 
24 0 

Gly Ser Val Asp 
255 

Pro Trp Gly Tyr 
270 

Arg Asp Trp Gin 
285 

Ser Leu Arg val 

Gin Asp Ser Thr 
320 

Val Phe Thr Asp 

' 335 

Thr Glu Gly Cys 
350 

Gin Tyr Gly Tyr 
365 

Glu Arg Ser Ser 
Arg Thr Gly Asn 

400 

Phe His Ser Ser 
415 

Pro Leu Val Asp 
430 

Gly Gly Val Gin 
445 

Tyr Lys Asn Trp 

Leu Gly Ser Gly 
480 

Asn Arg Met Glu 
495 

Asn Gly Met Thr 
510 

Asn Thr Met He 
525 

Thr Tyr Leu Glu 

Pro Val Asn Arg 
560 

Asn Gin Ser Ser 
575 

Glu He Val Pro 
590 

Gly Pro He Trp 
605 

Ser Pro Ala Met 

Leu lie Lys Asn 
640 

val Pro Val Ser 
655 

Val Glu Met Glu 
670 

Pro Glu He Gin 
685 

Phe Ala Pro Asp 

Thr Arg Tyr Leu 
720 
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Thr Arg Pro Leu 



SEQ ID NO: 27 

Thr Ala Pro Thr Gly Lys Arg lie Asp Asp His Phe Pro Lys Arg Lys 

15 10 15 

Lys Ala Arg Thr Glu Glu Asp Ser Lys Pro Ser Thr Ser Ser Asp Ala 

20 25 30 

Glu Ala Gly Pro Ser Gly Ser Gin Gin Leu Gin lie Pro Ala Gin Pro 

35 40 45 

Ala Ser Ser Leu Gly Ala Asp Thr Met Ser Ala Gly Gly Gly Gly Pro 

50 55 60 

Leu Gly Asp Asn Asn Gin Gly Ala Asp Gly Val Gly Asn Ala Ser Gly 
65 70 75 80 

Asp Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Val Thr Lys 

85 90 95 

Ser Thr Arg Thr Trp Val Leu Pro Ser Tyr Asn Asn His Gin Tyr Arg 

100 105 HO 

Glu lie Lys Ser Gly Ser Val Asp Gly Ser Asn Ala Asn Ala Tyr Phe 

115 120 125 

Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Ser 

130 135 140 

His Trp Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Tyr Trp Gly 
145 150 155 160 

Phe Arg Pro Arg Ser Leu Arg Val Lys lie Phe Asn lie Gin Val Lys 

165 170 175 

Glu Val Thr Val Gin Asp Ser Thr Thr Thr lie Ala Asn Asn Leu Thr 

180 185 190 

Ser Thr Val Gin Val Phe Thr Asp Asp Asp Tyr Gin Leu Pro Tyr Val 

195 200 205 

Val Gly Asn Gly Thr Glu Gly Cys Leu Pro Ala Phe Pro Pro Gin Val 

210 215 220 

Phe Thr Leu Pro Gin Tyr Gly Tyr Ala Thr Leu Asn Arg Asp Asn Thr 
225 230 235 240 

Glu Asn Pro Thr Glu Arg Ser Ser Phe Phe Cys Leu Glu Tyr Phe Pro 

245 250 255 

Ser Lys Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Thr Tyr Asn Phe 

260 265 270 

Glu Glu Val Pro Phe His Ser Ser Phe Ala Pro Ser Gin Asn Leu Phe 

275 280 285 

Lys Leu Ala Asn Pro Leu Val Asp Gin Tyr Leu Tyr Arg Phe Val Ser 

290 295 300 

Thr Asn Asn Thr Gly Gly Val Gin Phe Asn Lys Asn Leu Ala Gly Arg 
305 310 315 320 

Tyr Ala Asn Thr Tyr Lys Asn Trp Phe Pro Gly Pro Met Gly Arg Thr 

325 330 335 

Gin Gly Trp Asn Leu Gly Ser Gly Val Asn Arg Ala Ser Val Ser Ala 

340 345 350 

Phe Ala Thr Thr Asn Arg Met Glu Leu Glu Gly Ala Ser Tyr Gin Val 

355 360 365 

Pro Pro Gin Pro Asn Gly Met Thr Asn Asn Leu Gin Gly Ser Asn Thr 

370 375 380 

Tyr Ala Leu Glu Asn Thr Wet lie "Phe Asn Ser Gin Pro Ala Asn Pro 
385 390 395 400 

Gly Thr Thr Ala Thr Tyr CI- 32^- Met Leu lie Thr Ser Glu 

405 410 415 

Ser Glu Thr Gin Pro Val Asn Arg Val Ala Tyr Asn Val Gly Gly Gin 

420 425 430 

Met Ala Thr Asn Asn Gin Ser Ser Thr Thr Ala Pro Ala Thr Gly Thr 

435 440 445 

Tyr Asn Leu Gin Glu lie Val Pro Gly Ser Val Trp Met Glu Arg Asp 

450 455 460 

Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro Glu Thr Gly Ala 
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His 


Phe 


His 


Pro 


IT X. \J 




Met 


Met 








500 


Ser 


Phe 


Ser 


Asp 






515 




JLy 


bin 


Vai. 


i ill. 




530 






Lys 


Arg 


Trp 


Asn 


545 








Gin 


Phe 


Val 


Asp 


Arg 


Pro 


He 


Gly 








580 





470 






Ser 


Pro 


Ala 


Met 


40D 








LlC Li. 


Tie 


Lys 


Asn 


V Cl J. 


Pro 


Val 


Ser 








52 0 


Val 


Glu 


Met 


Glu 






535 




Pro 


Glu 


He 


Gin 




550 






Phe 


Ala 


Pro 


Asp 


565 








Thr 


Arg 


Tyr 


Leu 



475 

Gly Gly Phe Gly 
490 

Thr Pro Val Pro 
505 

Ser Phe He Thr 

Trp Glu Leu Lys 
540 

Tyr Thr Asn Asn 
555 

Ser Thr Gly Glu 
570 

Thr Arg Pro Leu 
565 



480 

Leu Lys His Pro 
495 

Gly Asn lie Thr 
510 

Gin Tyr Ser Thr 
525 

Lys Glu Asn Ser 

Tyr Asn Asp Pro 
560 

Tyr Arg Thr Thr 
575 



SEQ ID NO: 28 



Met 


Ser 


Ala 


Lily 


1 

Asp 


Gly 


val 


vjiy 








£ U 


Met 


G±y 


Asp 


Arg 






3 5 




Ser 


Tyr 


Asn 


Asn 




50 






Gly 


Ser 


Asn 


Ala 


65 








Pne 


ASp 


nu A 

pne 


Asn 


Arg 


Leu 


lie 


Asn 








1 Aft 
10O 


Lys 


Tl _ 

I xe 


T-> V-l 

pne 


Asn 






115 




Thr 


Thr 


He 


Ala 




130 






Asp 


Asp 


Tyr 


Gin 


14 5 








Leu 


Pro 


Ala 


Pne 


Ala 


Thr 


Leu 


Asn 








180 


Phe 


Phe 


Cys 


Leu 






195 




Asn 


Phe 


Glu 


Phe 




210 






Phe 


Ala 


Pro 


Ser 


225 








Gin 


Tyr 


Leu 


Tyr 


Phe 


Asn 


Lys 


Asn 








260 


Phe 


Pro 


Gly 


Pro 






275 




val 


Asn 


Arg 


Ala 




290 






Leu 


Glu 


Gly 


Ala 


305 








Asn 


Asn 


Leu 


Gin 


Phe 


Asn 


Ser 


Gin 








340 


Gly 


Asn 


Met 


Leu 






355 





La J.y 






Pro 


b 








Asn 


TV "1 


Q /a "V- 


Gly 


va 1 


v ax 


inr 


Lys 










nib 


vjj.11 


i-yr 












Asn 


Ala 


Tyr 


Phe 




70 






Arg 


Phe 


His 


0 >- 


8 5 








Asn 


Tyr 


Trp 


wly 


lie 


Gin 


Val 


Lys 










Asn 


Asn 


Leu 


1 XIX. 






135 




Leu 


Pro 


Tyr 


Val 




150 






Pro 


Pro 


Gin 


vai 


165 








Arg 


Asp 


Asn 


Thr 


Glu 


Tyr 


Phe 


Pro 








200 


Thr 


Tyr 


Asn 


Phe 






215 




Gin 


Asn 


Leu 


Phe 




230 






Arg 


Phe 


Val 


Ser 


245 








Leu 


Ala 


Gly 


Arg 


Met 


Gly Arg 


Thr 








280 


Ser 


Val 


Ser 


Ala 






295 




Ser 


Tyr 


Gin 


Val 




310 






Gly 


Ser 


Asn 


Thr 


325 








Pro 


Ala 


Asn 


Pro 


He 


Thr 


Ser 


Glu 








360 



Leu Gly Asp Asn 
10 

Asp Trp His Cys 
25 

Ser Thr Arg Thr 

Glu He Lys Ser 
60 

Gly Tyr Ser Thr 
75 

His Trp Ser Pro 
90 

Phe Arg Pro Arg 
105 

Glu Val Thr Val 

Ser Thr Val Gin 
140 

Val Gly Asn Gly 
155 

Phe Thr Leu Pro 
170 

Glu Asn Pro Thr 
185 

Ser Lys Met Leu 

Glu Glu Val Pro 
220 

Lys Leu Ala Asn 
235 

Thr Asn Asn Thr 
250 

Tyr Ala Asn Thr 
265 

Gin Gly Trp Asn 

Phe Ala Thr Thr 
300 

Pro Pro Gin Pro 
315 

Tyr Ala Leu Glu 
330 

Gly Thr Thr Ala 
345 

Ser Glu Thr Gin 



Asn Gin Gly Ala 
15 

Asp Ser Thr Trp 
30 

Trp Val Leu Pro 
45 

Gly Ser Val Asp 

Pro Trp Gly Tyr 
80 

Arg Asp Trp Gin 
95 

Ser Leu Arg Val 
110 

Gin Asp Ser Thr 
125 

Val Phe Thr Asp 

Thr Glu Gly Cys 
160 

Gin Tyr Gly Tyr 
175 

Glu Arg Ser Ser 
190 

Arg Thr Gly Asn 
205 

Phe His Ser Ser 

Pro Leu Val Asp 
240 

Gly Gly Val Gin 
255 

Tyr Lys Asn Trp 
270 

Leu Gly Ser Gly 
285 

Asn Arg Met Glu 

Asn Gly Met Thr 
320 

Asn Thr Met He 
335 

Thr Tyr Leu Glu 
350 

Pro Val Asn Arg 
365 



98 

233334_17.DOC 



ATTORNEY DOCKET NO. 14014.0427U1 



IT- T 

Vai 


A J. a 


Tyr 


Asn 




37 0 






mr 


i. XIX 


rtJ. cat 


Pro 


tod 

3B5 










Ser 


vdl 


Trp 


Ala 


Lys 


Tie 

lie 


Pro 








a o n 

ft z u 


Gly 


Gly 


Phe 


Gly 






435 




Thr 


Pro 


vai 


Pro 




450 






Ser 


Pne 


lie 


rpl_ ~_ 

i nr 


465 








Trp 


vjIU 


Leu 


Lys 


Tyr 


Thr 


Asn 


Asn 






500 


Ser 


Thr 


Gly 


Glu 






515 




Thr 


Arg 


Pro 


Leu 




530 







Val 


Gly 


Gly 


Gin 






375 




Ala 


Thr 


GlV 


Thr 




3 90 






Met 


Glu 


Arcr 


Asd 


4 0 5 








Glu 


Thr 


Glv 


Ala 


Leu 


Lys 


His 


Pro 








44 0 


Gly 


Asn 


He 


Thr 






455 




Gin 


TVr 


Ser 


Thr 




*x / \J 






Lys 


Glu 


Asn 


Ser 


485 








Tyr 


Asn 


Asp 


Pro 


Tyr 


Arg 


Thr 


Thr 








520 



Met 


Ala 


Thr 


Asn 








380 


Tyr 


Asn 


Leu 


Gin 






395 




Val 


Tvr 


Leu 


Gin 




410 






His 


Phe 


His 


Pro 


425 








Pro 


Pro 


Met 


Met 


Ser 


Phe 


Ser 


ASO 








460 


Gly 


Gin 


Val 


Thr 






475 




Lys 


Arg 


Trp 


Asn 




490 






Gin 


Phe 


Val 


Asp 


505 








Arg 


Pro 


He 


Gly 



Asn 


Gin 


Ser 


Ser 


Glu 


He 


Val 


Pro 








400 


Gly 


Pro 


He 


Trp 






415 




Ser 


Pro 


Ala 


Met 




430 






Leu 


lie 


Lys 


Asn 


445 








Val 


Pro 


Val 


Ser 


Val 


Glu 


Met 


Glu 








480 


Pro 


Glu 


He 


Gin 






495 




Phe 


Ala 


Pro 


ASp 




510 






Thr 


Arg 


Tyr 


Leu 


525 









SEQ ID NO: 29 



aggctctcat 
ctgcgaccgc 
accctccaga 
cgggcccacc 
tgcctggt ta 
gggcagacga 
acaaccccta 
acacatcctt 
aaccttttgg 
accactttcc 
cagacgccga 
caagtttggg 
aaggtgccga 
gggacagagt 
agtaccgaga 
acagcacccc 
actggcaaag 
tcttcaacat 
acctcacctc 
gcaacgggac 
acggttacgc 
tctgcctaga 
acaactttga 
tggccaaccc 
gagtccagtt 
cggggcccat 
tcagcgcctt 
cgcagccgaa 
ctatgatctt 
acatgctcat 
gcgggcagat 
acctccagga 
ccatctgggc 
gattcggact 
atatcaccag 
aggtcaccgt 
agatccagta 
ccggggaata 
ccattcatgt 
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ttgttcccga 
tcaattggaa 
ttggttggaa 
gaaaccaaaa 
taactatctc 
ggtcgcgcga 
cctcaagtac 
cgggggaaac 
cctggttgaa 
aaaaagaaag 
agctggaccc 
agctgataca 
tggagtgggc 
cgtcaccaag 
gatcaaaagc 
ctgggggtac 
actcatcaac 
tcaagtcaaa 
caccgtccaa 
cgagggatgc 
gacgctgaac 
gtactttccc 
ggaggtgccc 
gctggtggac 
caacaagaac 
gggccgaacc 
cgccacgacc 
cggcatgacc 
caacagccag 
caccagcgag 
ggccaccaac 
aatcgtgccc 
caagatccca 
caaacaccca 
cttctcggac 
ggagatggag 
cacaaacaac 
cagaaccacc 
cgcataccct 



gacgcctcgc 
ttcaagtaaa 
gaagttggtg 
cccaatcagc 
ggacccggaa 
gagcacgaca 
aaccacgcgg 
ctcggaaagg 
gagggtgcta 
aaggctcgga 
agcggatccc 
atgtctgcgg 
aatgcctcgg 
tccacccgaa 
ggctccgtcg 
tttgacttta 
aactactggg 
gaggtcacgg 
gtgtttacgg 
ctgccggcct 
cgcgacaaca 
agcaagatgc 
ttccactcca 
cagtacttgt 
ctggccggga 
cagggctgga 
aataggatgg 
aacaacctcc 
ccggcgaacc 
agcgagacgc 
aaccagagct 
ggcagcgtgt 
gagacggggg 
ccgcccatga 
gtgcccgtca 
tgggagctca 
tacaacgacc 
agacctatcg 
caataaa 



agttcagacg 
taaagcgagt 
aaggtcttcg 
agcatcaaga 
acggtctcga 
tctcgtacaa 
acgccgagtt 
cagtctttca 
agacggcccc 
ccgaagagga 
agcagctgca 
gaggtggcgg 
gagattggca 
cctgggtgct 
acggaagcaa 
accgcttcca 
gcttcagacc 
tgcaggactc 
acgacgacta 
tccctccgca 
cagaaaatcc 
tgagaacggg 
gcttcgctcc 
accgcttcgt 
gatacgccaa 
acctgggctc 
agctcgaggg 

agggcagcaa 

cgggcaccac 
agccggtgaa 
crraccactgc 
ggatggagag 
cgcactttca 
tgctcatcaa 
gcagcttcat 
agaaggaaaa 
cccagtttgt 
gaacccgata 
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tgactgttga 
agtcatgtct 
cgagtttttg 
tcaagcccgt 
tcgaggagag 
cgagcagctt 
tcaggagaag 
ggccaagaaa 
taccggaaag 
ctccaagcct 
aatcccagcc 
cccattgggc 
ttgcgattcc 
gcccagctac 
cgccaacgcc 
cagccactgg 
ccggtccctc 
caccaccacc 
ccagctgccc 
ggtctttacg 
caccgagagg 
caacaacttt 
cagtcagaac 
gagcacaaat 
cacctacaaa 
cggggtcaac 
cgcgagttac 
cacctatgcc 
cgccacgtac 
ccgcgtggcg 
ccccgcgacc 
ggacgtgtac 
cccctctccg 
gaacacgcct 
cacccagtac 
ctccaagagg 
ggactttgcc 
ccttacccga 



tcccgctcct 
tttgttgatc 
ggccttgaag 
ggtcttgtgc 
cctgtcaaca 
gaggcgggag 
ctcgccgacg 
agggttctcg 
cggatagacg 
tccacctcgt 
caaccagcct 
gacaataacc 
acgtggatgg 
aacaaccacc 
tactttggat 
agcccccgag 
agagtcaaaa 
atcgccaaca 
tacgtcgtcg 
ctgccgcagt 
agcagcttct 
gagtttacct 
ctgttcaagc 
aacactggcg 
aactggttcc 
cgcgccagtg 
caggtgcccc 
ctggagaaca 
ctcgagggca 
tacaacgtcg 
ggcacgtaca 
ctccaaggac 
gccatgggcg 
gtgcccggaa 
agcaccgggc 
tggaacccag 
ccggacagca 
cccctttaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2307 
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SEQ XD NO: 30 

aggctctcat ttgttcccga gacgcctcgc agttcagacg tgactgttga tcccgctcct 60 

ctgcgaccgc tcaattggaa ttcaagattg gttggaagaa gttggtgaag gtcttcgcga 120 

gtttttgggc cttgaagcgg gcccaccgaa accaaaaccc aatcagcagc atcaagatca 180 

agcccgtggt cttgtgctgc ctggttataa ctatctcgga cccggaaacg gtctcgatcg 240 

aggagagcct gtcaacaggg cagacgaggt cgcgcgagag cacgacatct cgtacaacga 3 00 

gcagcttgag gcgggagaca acccctacct caagtacaac cacgcggacg ccgagtttca 360 

ggagaagctc gccgacgaca catccttcgg gggaaacctc ggaaaggcag cctttcaggc 42 0 

caagaaaagg gttctcgaac cttttggcct ggttgaagag ggtgctaaga cggcccctac 480 

cggaaagcgg atagacgacc actttccaaa aagaaagaag gctcggaccg aagaggactc 54 0 

caagccttcc acctcgtcag acgccgaagc tggacccagc ggatcccagc agctgcaaat 600 

cccagcccaa ccagcctcaa gtttgggagc tgatacaatg tctgcgggag gtggcggccc 660 

attgggcgac aataaccaag gtgccgatgg agtgggcaat gcctcgggag attggcattg 72 0 

cgattccacg tggatggggg acagagtcgt caccaagtcc acccgaacct gggtgctgcc 780 

cagctacaac aaccaccagt accgagagat caaaagcggc tccgtcgacg gaagcaacgc 84 0 

caacgcctac tttggataca gcaccccctg ggggtacttt gactttaacc gcttccacag 900 

ccactggagc ccccgagact ggcaaagact catcaacaac tactggggct tcagaccccg 960 

gtccctcaga gtcaaaatct tcaacattca agtcaaagag gtcacggtgc aggactccac 1020 

caccaccatc gccaacaacc tcacctccac cgtccaagtg tttacggacg acgactacca 1080 

gctgccctac gtcgtcggca acgggaccga gggatgcctg ccggccttcc ctccgcaggt 114 0 

ctttacgctg ccgcagtacg gttacgcgac gctgaaccgc gacaacacag aaaatcccac 12 00 

cgagaggagc agcttcttct gcctagagta ctttcccagc aagatgctga gaacgggcaa 1260 

caactttgag tttacctaca actttgagga ggtgcccttc cactccagct tcgctcccag 1320 

tcagaacctg ttcaagctgg ccaacccgct ggtggaccag tacttgtacc gcttcgtgag 13 80 

cacaaataac actggcggag tccagttcaa caagaacctg gccgggagat acgccaacac 1440 

ctacaaaaac tggttcccgg ggcccatggg ccgaacccag ggctggaacc tgggctccgg 1500 

ggtcaaccgc gccagtgtca gcgccttcgc cacgaccaat aggatggagc tcgagggcgc 1560 

gagttaccag gtgcccccgc agccgaacgg catgaccaac aacctccagg gcagcaacac 162 0 

ctatgccctg gagaacacta tgatcttcaa cagccagccg gcgaacccgg gcaccaccgc 1680 

cacgtacctc gagggcaaca tgctcatcac cagcgagagc gagacgcagc cggtgaaccg 174 0 

cgtggcgtac aacgtcggcg ggcagatggc caccaacaac cagagctcca ccactgcccc 1800 

cgcgaccggc acgtacaacc tccaggaaat cgtgcccggc agcgtgtgga tggagaggga 1860 

cgtgtacctc caaggaccca tctgggccaa gatcccagag acgggggcgc actttcaccc 1920 

ctctccggcc atgggcggat tcggactcaa acacccaccg cccatgatgc tcatcaagaa 1980 

cacgcctgtg cccggaaata tcaccagctt ctcggacgtg cccgtcagca gcttcatcac 2040 

ccagtacagc accgggcagg tcaccgtgga gatggagtgg gagctcaaga aggaaaactc 2100 

caagaggtgg aacccagaga tccagtacac aaacaactac aacgaccccc agtttgtgga 2160 

ctttgccccg gacagcaccg gggaatacag aaccaccaga cctatcggaa cccgatacct 2220 

tacccgaccc ctttaaccca ttcatgtcgc ataccctcaa taaa 2264 



SEQ ID NO: 31 

aggctctcat ttgttcccga gacgcctcgc agttcagacg tgactgttga tcccgctcct 60 

ctgcgaccgc tcaattggaa ttcaagattg gttggaagaa gttggtgaag gtcttcgcga 12 0 

gtttttgggc cttgaagcgg gcccaccgaa accaaaaccc aatcagcagc atcaagatca 180 

agcccgtggt cttgtgctgc ctggttataa ctatctcgga cccggaaacg gtctcgatcg 240 

aggagagcct gtcaacaggg cagacgaggt cgcgcgagag cacgacatct cgtacaacga 3 00 

gcagcttgag gcgggagaca acccctacct caagtacaac cacgcggacg ccgagtttca 3 60 

ggagaagctc gccgacgaca catccttcgg gggaaacctc ggaaaggcag tctttcaggc 42 0 

caagaaaagg gttctcgaac cttttggcct ggttgaagag ggtgctaaga cggcccctac 480 

cggaaagcgg atagacgacc actttccaaa aagaaagaag gctcggaccg aagaggactc 54 0 

A^r.ia acctcgtcag ac gc c ga agc tggacccagc ggatcccagc agctgcaaat 6 CO 

cccagcccaa ccagcctcaa gtttgggagc tgatacaatg tctgcgggag gtggcggccc 660 

attgggcgac aataaccaag gtgccgatgg agtgggcaat gcctcgggag attggcattg 720 

cga_t.tccacg tggatggggg acagagtcgt caccaagtcc acccgaacct gggtgctgcc 7 80 

cagctacaac aaccaccagt accgagagat caaaagcggc tccgtcgacg gaagcaacgc 84 O 

caacgcctac tttggataca gcaccccctg ggggtacttt gactttaacc gcttccacag 900 

ccactggagc ccccgagact ggcaaagact catcaacaac tactggggct tcagaccccg 960 

gtccctcaga gtcaaaatct tcaacattca agtcaaagag gtcacggtgc aggactccac 1020 

caccaccatc gccaacaacc tcacctccac cgtccaagtg tttacggacg acgactacca 1080 

100 
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gctgccctac gtcgtcggca acgggaccga gggatgcctg ccggccttcc ctccgcaggt 1140 

ctttacgctg ccgcagtacg gttacgcgac gctgaaccgc gacaacacag aaaatcccac 1200 

cgagaggagc agcttcttct gcctagagta ctttcccagc aagatgctga gaacgggcaa 1260 

caactttgag tttacctaca actttgagga ggtgcccttc cactccagct tcgctcccag 1320 

tcagaacctg ttcaagctgg ccaacccgct ggtggaccag tacttgtacc gcttcgtgag 1380 

cacaaataac actggcggag tccagttcaa caagaacctg gccgggagat acgccaacac 1440 

ctacaaaaac tggttcccgg ggcccatggg ccgaacccag ggctggaacc tgggctccgg 1500 

ggtcaaccgc gccagtgtca gcgccttcgc cacgaccaat aggatggagc tcgagggcgc 1560 

gagttaccag gtgcccccgc agccgaacgg catgaccaac aacctccagg gcagcaacac 1620 

ctatgccctg gagaacacta tgatcttcaa cagccagccg gcgaacccgg gcaccaccgc 1680 

cacgtacctc gagggcaaca tgctcatcac cagcgagagc gagacgcagc cggtgaaccg 1740 

cgtggcgtac aacgtcggcg ggcagatggc caccaacaac cagagctcca ccactgcccc 1800 

cgcgaccggc acgtacaacc tccaggaaat cgtgcccggc agcgtgtgga tggagaggga 1860 

cgtgtacctc caaggaccca tctgggccaa gatcccagag acgggggcgc actttcaccc 1920 

ctctccggcc atgggcggat tcggactcaa acacccaccg cccatgatgc tcatcaagaa 1980 

cacgcctgtg cccggaaata tcaccagctt ctcggacgtg cccgtcagca gcttcatcac 2040 

ccagtacagc accgggcagg tcaccgtgga gatggagtgg gagctcaaga aggaaaactc 2100 

caagaggtgg aacccagaga tccagtacac aaacaactac aacgaccccc agtttgtgga 2160 

ctttgccccg gacagcaccg gggaatacag aaccaccaga cctatcggaa cccgatacct 2220 

tacccgaccc ctttaaccca ttcatgtcgc ataccctcaa taaa 2264 



SEQ ID NO: 32 

agcgcaaacg gctcgtcgcg cagtttctgg cagaatcctc gcagcgctcg caggaggcgg 60 

cttcgcagcg tgagttctcg gctgacccgg tcatcaaaag caagacttcc cagaaataca 120 

tggcgctcgt caactggctc gtggagcacg gcatcacttc cgagaagcag tggatccagg 180 

aaaatcagga gagctacctc tccttcaact ccaccggcaa ctctcggagc cagatcaagg 240 

ccgcgctcga caacgcgacc aaaattatga gtctgacaaa aagcgcggtg gactacctcg 300 

tggggagctc cgttcccgag gacatttcaa aaaacagaat ctggcaaatt tttgagatga 360 

atggctacga cccggcctac gcgggatcca tcctctacgg ctggtgtcag cgctccttca 420 

acaagaggaa caccgtctgg ctctacggac ccgccacgac cggcaagacc aacatcgcgg 480 

aggccatcgc ccacactgtg cccttttacg gctgcgtgaa ctggaccaat gaaaactttc 540 

cctttaatga ctgtgtggac aaaatgctca tttggtggga ggagggaaag atgaccaaca 600 

aggtggttga atccgccaag gccatcctgg ggggctcaaa ggtgcgggtc gatcagaaat 660 

gtaaatcctc tgttcaaatt gattctaccc ctgtcattgt aacttccaat acaaacatgt 720 

gtgtggtggt ggatgggaat tccacgacct ttgaacacca gcagccgctg gaggaccgca 780 

tgttcaaatt tgaactgact aagcggctcc cgccagattt tggcaagatt actaagcagg 84 0 

aagtcaagga cttttttgct tgggcaaagg tcaatcaggt gccggtgact cacgagttta 900 

aagttcccag ggaattggcg ggaactaaag gggcggagaa atctctaaaa cgcccactgg 960 

gtgacgtcac caatactagc tataaaagtc tggagaagcg ggccaggctc tcatttgttc 102 0 

ccgagacgcc tcgcagttca gacgtgactg ttgatcccgc tcctctgcga ccgctcaatt 1080 

ggaattcaag gtatgattgc aaatgtgact atcatgctca atttgacaac atttctaaca 1140 

aatgtgatga atgtgaatat ttgaatcggg gcaaaaatgg atgtatctgt cacaatgtaa 1200 

ctcactgtca aatttgtcat gggattcccc cctgggaaaa ggaaaacttg tcagattttg 1260 

gggattttga cgatgccaat aaagaacagt aa 1292 



SEQ ID NO: 33 

attctttgct ctggactgct agaggaccct cgctgccatg gctaccttct atgaagtcat 6 0 

tgttcgcgtc ccatttgacg tggaggaaca tctgcctgga atttctgaca gctttgtgga 12 0 

ctgggtaact ggtcaaattt gggagctgcc tccagagtca gatttaaatt tgactctggt 180 

tgaacagcct cagttgacgg tggctgatag aattcgccgc gcgctcctgt acgagtggaa 24 0 

caaattttcc aagcaggagt ccaaattctt tgtgcagttt gaaaagggat ctgaatattt 300 

tcatctgcac acgcttgtgg a^acc^aa- v«uvt?crtcc art gg tec teg gccgctacgt 36 0 

gagtcagatt cgcgcccagc tggtgaaagt ggtcttccag ggaattgaac cccagatcaa 42 0 

cgactgggtc gccatcacca aggtaaagaa gggeggagee aataaggtgg tggattctgg 48 0 

gtatattccc gcctacctgc— Jtgrcgaaggt ccaaccggag cttcagtggg cgtggacaaa 54 0 

cctggacgag tataaattgg ccgccctgaa tctggaggag cgcaaacggc tcgtcgcgca 60 0 

gtttctggca gaatcctcgc agcgctcgca ggaggegget tegcagegtg agttctcggc 66 0 

tgacccggtc atcaaaagca agacttccca gaaatacatg gcgctcgtca actggctcgt 72 0 

ggagcaegge atcacttccg agaagcagtg gatccaggaa aatcaggaga gctacctctc 780 

cttcaactcc accggcaact ctcggagcca gatcaaggee gcgctcgaca acgcgaccaa 84 0 
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aattatgagt ctgacaaaaa gcgcggtgga 
catttcaaaa aacagaatct ggcaaatttt 
gggatccatc ctctacggct ggtgtcagcg 
ctacggaccc gccacgaccg gcaagaccaa 
cttttacggc tgcgtgaact ggaccaatga 
aatgctcatt tggtgggagg agggaaagat 
catcctgggg ggctcaaagg tgcgggtcga 
ttctacccct gtcattgtaa cttccaatac 
cacgaccttt gaacaccagc agccgctgga 
gcggctcccg ccagattttg gcaagattac 
ggcaaaggtc aatcaggtgc cggtgactca 
aactaaaggg gcggagaaat ctctaaaacg 
taaaagtctg gagaagcggg ccaggctctc 
cgtgactgtt gatcccgctc ctctgcgacc 
atgtgactat catgctcaat ttgacaacat 
gaatcggggc aaaaatggat gtatctgtca 
gattcccccc tgggaaaagg aaaacttgtc 
agaacagtaa 



SEQ ID NO: 34 



Met 


a 


Leu 


Val 


Asn 


Trn 


Leu 


Val 


Glu 


His 


Glv 


He 


Thr 


Ser 


Glu 


Lys 


1 








c 










10 










15 




bin 


Trp 


He 


Gin 


Glu 


Asn 


Gin 


Glu 


Ser 


Tvr 
j 


Leu 


Ser 


Phe 


Asn 


Ser 


Thr 






20 




















30 






uiy 


Asn 


Ser 


Arg 


Ser 




Tip 

lie 


Lys 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Thr 


Lys 




35 








a. n 










4 5 








lie 


Met 
50 


Ser 


Leu 


XIIL 


Lys 




Ala 


Val 


Asp 


Tvr 


Leu 
60 


val 


Gly 


Ser 


Ser 


Val 


Pro 


u 1 U 


Asp 


Tl p 

lie 




Lys 


Asn 


rt.x, y 


He 




Gin 


He 


Phe 


Glu 


Met 


65 








/ U 










75 










80 


Asn 


Giy 


Tyr 


Asp 


Pro 


A "1 -a 
rt. J_ d 


Tyr 


rt.X d 


Gly 


Ser 


lie 


Leu 


Tvr 


Glv 




Cys 




Q ^ 

O 3 










90 










95 




Gin 




Ser 


Phe 


Asn 


Lys 


Ara 


Asn 


Thr 


Val 


Trp 


Leu 


Tyr 


Gly 


Pro 


Ala 






100 










105 










110 






Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 






115 








120 










125 








Phe 


Tyr 


Gly 


Cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 




130 








135 










140 










Cys 


Val 


Asp 


Lys 


Met 


Leu 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Asn 


145 






150 










155 








Val 


160 


Lys 


Val 


Val 


Glu 


Ser 


Ala 


Lys 


Ala 


lie 


Leu 


Gly 


Gly 


Ser 


Lys 


Arg 








165 










170 










175 




Val 


Asp 


Gin 


Lys 


Cys 


Lys 


Ser 


Ser 


Val 


Gin 


He 


Asp 


Ser 


Thr 


Pro 


val 






180 










185 










190 






lie 


Val 


Thr 
195 


Ser 


Asn 


Thr 


Asn 


Met 
200 


Cys 


Val 


Val 


Val 


Asp 
205 


Gly 


Asn 


Ser 


Thr 


Thr 
210 


Phe 


Glu 


His 


Gin 


Gin 
215 


Pro 


Leu 


Glu 


Asp 


Arg 
220 


Met 


Phe 


Lys 


Phe 


Glu 


Leu 


Thr 


Lys 


Arg 


Leu 


Pro 


Pro 


Asp 


Phe 


Gly 


Lys 


He 


Thr 


Lys 


Gin 


225 






230 










235 










240 


Glu 


Val 


Lys 


Asp 


Phe 


Phe 


Ala 


Trp 


Ala 


Lys 


Val 


Asn 


Gin 


Val 


Pro 


Val 






245 










250 










255 




Thr 


His 


Glu 


Phe 


Lys 


Val 


Pro 


Arg 


Glu 


Leu 


Ala 


Gly 


Thr 


Lys 


Gly 


Ala 








260 








265 










270 






Glu 


Lys 


Ser 


Leu 


Lys 


Arg 


Pro 


Leu 


Gly 


Asp 


Val 


Thr 


Asn 


Thr 


Ser 


Tyr 




275 










280 










285 








Lys 


Ser 


Leu 


Glu 


Lys 


Arg 


Ala 


Arg 


Leu 


Ser 


Phe 


Val 


Pro 


Glu 


Thr 


Pro 


290 










295 










300 










Arg 


Ser 


Ser 


Asp 


Val 


Thr 


Val 


Asp 


Pro 


Ala 


Pro 


Leu 


Arg 


Pro 


Leu 


Asn 


305 








310 










315 










320 


Trp 


Asn 


Ser 


Arg 


Leu 


Val 


Gly 


Arg 


Ser 


Trp 















325 330 



102 



ctacctcgtg gggagctccg 
tgagatgaat ggctacgacc 
ctccttcaac aagaggaaca 
catcgeggag gccatcgccc 
aaactttccc tttaatgact 
gaccaacaag gtggttgaat 
tcagaaatgt aaatcctctg 
aaacatgtgt gtggtggtgg 
ggaccgcatg ttcaaatttg 
taagcaggaa gtcaaggact 
cgagtttaaa gttcccaggg 
cccactgggt gacgtcacca 
atttgttccc gagacgcctc 
gctcaattgg aattcaaggt 
ttctaacaaa tgtgatgaat 
caatgtaact cactgtcaaa 
agattttggg gattttgacg 



ttcccgagga 900 

cggcctacgc 960 

ccgtctggct 1020 

acactgtgcc 1080 

gtgtggacaa 1140 

ccgccaaggc 1200 

ttcaaattga 1260 

atgggaattc 1320 

aactgactaa 1380 

tttttgcttg 1440 

aattggcggg 1500 

atactagcta 1560 

gcagttcaga 1620 

atgattgcaa 1680 

gtgaatattt 1740 

tttgtcatgg 1800 

atgccaataa 1860 
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SEQ ID NO: 35 



aggagcgcaa 
cggcttcgca 
acatggcgct 
aggaaaatca 
aggccgcgct 
tcgtggggag 
tgaatggcta 
tcaacaagag 
cggaggccat 
ttccctttaa 
acaaggtggt 
aatgtaaatc 
tgtgtgtggt 
gcatgttcaa 
aggaagtcaa 
ttaaagttcc 
tgggtgacgt 
ttcccgagac 
attggaattc 



acggctcgtc 
gcgtgagttc 
cgtcaactgg 
ggagagctac 
cgacaacgcg 
ctccgttccc 
cgacccggcc 
gaacaccgtc 
cgcccacact 
tgactgtgtg 
tgaatccgcc 
ctctgttcaa 
ggtggatggg 
atttgaactg 
ggactttttt 
cagggaattg 
caccaatact 
gcctcgcagt 
aagattggtt 



gcgcagtttc 
tcggctgacc 
ctcgtggagc 
ctctccttca 
accaaaatta 
gaggacattt 
tacgcgggat 
tggctctacg 
gtgccctttt 
gacaaaatgc 
aaggccatcc 
attgattcta 
aattccacga 
actaagcggc 
gcttgggcaa 
gcgggaacta 
agctataaaa 
tcagacgtga 
ggaagaagtt 



tggcagaatc 
cggtcatcaa 
acggcatcac 
actccaccgg 
tgagtctgac 
caaaaaacag 
ccatcctcta 
gacccgccac 
acggctgcgt 
tcatttggtg 
tggggggctc 
cccctgtcat 
cctttgaaca 
tcccgccaga 
aggtcaatca 
aaggggcgga 
gtctggagaa 
ctgttgatcc 
ggtga 



ctcgcagcgc 
aagcaagact 
ttccgagaag 
caactctcgg 
aaaaagcgcg 
aatctggcaa 
cggctggtgt 
gaccggcaag 
gaactggacc 
ggaggaggga 
aaaggtgcgg 
tgtaacttcc 
ccagcagccg 
ttttggcaag 
ggtgccggtg 
gaaatctcta 
gcgggccagg 
cgctcctctg 



tcgcaggagg 
tcccagaaat 
cagtggatcc 
agccagatca 
gtggactacc 
atttttgaga 
cagcgctcct 
accaacatcg 
aatgaaaact 
aagatgacca 
gtcgatcaga 
aatacaaaca 
ctggaggacc 
attactaagc 
actcacgagt 
aaacgcccac 
ctctcatttg 
cgaccgctca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1115 



SEQ ID NO: 36 



Met 


Ala 


Thr 


Phe 


Tyr 


Glu 


Val 


lie 


1 

Glu 


His 


Leu 


Pro 


5 

Gly 


He 


Ser 


Asp 








20 










Gin 


He 


Trp 


Glu 


Leu 


Pro 


Pro 


Glu 






35 










40 


Glu 


Gin 


Pro 


Gin 


Leu 


Thr 


Val 


Ala 




50 










55 




Tyr 


Glu 


Trp 


Asn 


Lys 


Phe 


Ser 


Lys 


65 










70 






Phe 


Glu 


Lys 


Gly 


Ser 


Glu 


Tyr 


Phe 










85 








Ser 


Gly 


He 


Ser 


Ser 


Met 


Val 


Leu 








100 










Ala 


Gin 


Leu 


val 


Lys 


Val 


Val 


Phe 






115 










120 


Asp 


Trp 


Val 


Ala 


He 


Thr 


Lys 


Val 




130 










135 




Val 


Asp 


Ser 


Gly 


Tyr 


He 


Pro 


Ala 


145 










150 






Glu 


Leu 


Gin 


Trp 


Ala 


Trp 


Thr 


Asn 










165 








Leu 


Asn 


Leu 


Glu 


Glu 


Arg 


Lys 


Arg 








180 










Ser 


Ser 


Gin 


Arg 


Ser 


Gin 


Glu 


Ala 






195 










200 


Asp 


Pro 


Val 


He 


Lys 


Ser 


Lys 


Thr 




210 














Asn 


Trp 


Leu 


Val 


Glu 


His 


Gly 


He 


225 










230 






Glu 


Asn 


Gin 


Glu 


Ser 


Tyr 


Leu 


Ser 










245 


Ala 






Ser 


Gin 


lie 


Lys 


Ala 


Leu 


Asp 








260 










Thr 


Lys 


Ser 


Ala 


Val 


Asp 


Tyr 


Leu 




275 










280 
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val 


Arq 


Val 


Pro 


Phe 


Asp 


Val 


Glu 




10 










15 




Ser 


Phe 


Val 


Asp 


Trp 


Val 


Thr 


Gly 


25 










30 






Ser 


Asp 


Leu 


Asn 


Leu 


Thr 


Leu 


Val 










45 








Asp 


Arg 


He 


Arg 


Arg 


Val 


Phe 


Leu 








60 










Gin 


Glu 


Ser 


Lys 


Phe 


Phe 


Val 


Gin 






75 










80 


His 


Leu 


His 


Thr 


Leu 


Val 


Glu 


Thr 




90 










95 




Gly 


Arg 


Tyr 


Val 


Ser 


Gin 


He 


Arg 


105 










110 






Gin 


Gly 


He 


Glu 


Pro 


Gin 


He 


Asn 








125 








Lys 


Lys 


Gly 


Gly 


Ala 


Asn 


Lys 


Val 








140 










Tyr 


Leu 


Leu 


Pro 


Lys 


Val 


Gin 


Pro 




155 










160 


Leu 


Asp 


Glu 


Tyr 


Lys 


Leu 


Ala 


Ala 




170 










175 




Leu 


Val 


Ala 


Gin 


Phe 


Leu 


Ala 


Glu 


185 










190 






Ala 


Ser 


Gin 


Arg 


Glu 


Phe 


Ser 


Ala 










205 








Ser 


Gin 


Lys 


Tyr 


Met 


Ala 


Leu 


Val 








220 










Thr 


Ser 


Glu 


Lys 


Gin 


Trp 


lie 


Gin 






235 










240 


Phe 


Asn 


Ser 


Thr 


Gly 


Asn 


Ser 


Arg 




250 










255 




Asn 


Ala 


Thr 


Lys 


He 


Met 


Ser 


Leu 


265 










270 






Val 


Gly 


Ser 


Ser 


val 


Pro 


Glu 


Asp 



285 
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lie 


Sex* 


Lys 


Asn 


Arc* 


He 


Tro 


Gin 


He 


Phe 


Glu 


Met 


Asn 


Gly 


Tyr 


Asp 




2 90 








2 95 










300 










Pro 


Ala 


Tvr 
2 


Ala 


Gly 


Ser 


He 


Leu 


Tyr 


Gly 


Trp 


Cys 


Gin 


Arg 


Ser 


Phe 


3 05 






310 










315 










320 


Asn 


Lys 


Arq 


Asn 


Thr 


Val 


Trp 


Leu 


Tyr 


Gly 


Pro 


Ala 


Thr 


Thr 


Gly 


Lys 






325 










330 










335 




Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Val 


Pro 


Phe 


Tyr 


Gly 


Cys 








34 0 










345 










350 






Val 


Asn 


Trn 


Thr 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe 


Asn 


Asp 


Cys 


Val 


Asp 


Lys 






-J -> ZJ 










3 60 










365 








Met 


Leu. 


lie 


Trti 


Tro 


Glu 


Glu 


Gly 


Lys 


Met 


Thr 


Asn 


Lys 


Val 


Val 


Glu 




3 7 0 










375 










380 










Ser 


Ala 


Lys 


Ala 


He 


Leu 


Gly 


Gly 


Ser 


Lys 


Val 


Arg 


Val 


Asp 


Gin 


Lys 


3 85 








390 










395 










400 


Cys 


Lvs 


Ser 


Ser 


Val 


Gin 


He 


Asp 


Ser 


Thr 


Pro 


Val 


He 


Val 


Thr 


Ser 






405 










410 










415 




Asn 


Thr 


Asn 


Met 


Cys 


Val 


Val 


Val 


Asp 


Gly 


Asn 


Ser 


Thr 


Thr 


Phe 


Glu 








420 










425 










430 






His 


Gin 


Gin 


Pro 


Leu 


Glu 


Asp 


Arg 


Met 


Phe 


Lys 


Phe 


Glu 


Leu 


Thr 


Lys 






4 3 5 










440 










445 








Arg 


Leu 


Pro 


Pro 


Asp 


Phe 


Gly 
i 


Lys 

2 


He 


Thr 


Lys 


Gin 


Glu 


Val 


Lys 


Asp 


4 5 0 










455 










460 










Phe 


Phe 


Ala 


Tro 


Ala 


Lvs 


Val 


Asn 


Gin 


Val 


Pro 


val 


Thr 


His 


Glu 


Phe 


4 65 








470 










475 










480 


Lys 


Val 


Pro 


Arg 


Glu 


Leu 


Ala 


Gly 


Thr 


Lys 


Gly 


Ala 


Glu 


Lys 


Ser 


Leu 








485 










490 










495 




Lys 


Arg 


Pro 


Leu 


Gly 


Asp 


Val 


Thr 


Asn 


Thr 


Ser 


Tyr 


Lys 


Ser 


Leu 


Glu 




500 










505 










510 






Lys 


Arg 


Ala 


Arg 


Leu 


Ser 


Phe 


Val 


Pro 


Glu 


Thr 


Pro 


Arg 


Ser 


Ser 


Asp 


515 










520 










525 








Val 


Thr 


Val 


Asp 


Pro 


Ala 


Pro 


Leu 


Arg 


Pro 


Leu 


Asn 


Trp 


Asn 


Ser 


Arg 




530 








535 










540 










Leu 


Val 


Gly 


Arg 


Ser 


Trp 






















545 










550 























SKQ ID NO: 37 

attctttgct ctggactgct 

tgttcgcgtc ccatttgacg 

ctgggtaact ggtcaaattt 

tgaacagcct cagttgacgg 

caaattttcc aagcaggagt 

tcatctgcac acgcttgtgg 

gagtcagatt cgcgcccagc 

cgactgggtc gccatcacca 

gtatattccc gcctacctgc 

cctggacgag tataaattgg 

gtttctggca gaatcctcgc 

tgacccggtc atcaaaagca 

ggagcacggc atcacttccg 

cttcaactcc accggcaact 

aattatgagt ctgacaaaaa 

catttcaaaa aacagaatct 

gggatccatc ctctacggct 

ctacggaccc gccacgaccg 

cttttacggc tgcgtgaact 

aatgctcatt tggtgggagg 

catcctgggg ggctcaaagg 

ttctacccct gtcattgtaa 

cacgaccttt gaacaccagc 

gcggctcccg ccagattttg 

ggcaaaggtc aatcaggtgc 

aactaaaggg gcggagaaat 



agaggaccct cgctgccatg gctaccttct atgaagtcat 60 

tggaggaaca tctgcctgga atttctgaca gctttgtgga 120 

gggagctgcc tccagagtca gatttaaatt tgactctggt 180 

tggctgatag aattcgccgc gtgttcctgt acgagtggaa 240 

ccaaattctt tgtgcagttt gaaaagggat ctgaatattt 300 

agacctccgg catctcttcc atggtcctcg gccgctacgt 360 

tggtgaaagt ggtcttccag ggaattgaac cccagatcaa 420 

aggtaaagaa gggcggagcc aataaggtgg tggattctgg 4 80 

tgccgaaggt ccaaccggag cttcagtggg cgtggacaaa 540 

ccgccctgaa tctggaggag cgcaaacggc tcgtcgcgca 600 

agcgctcgca ggaggcggct tcgcagcgtg agttctcggc 660 

agacttccca gaaatacatg gcgctcgtca actggctcgt 72 0 

agaagcagtg gatccaggaa aatcaggaga gctacctctc 780 

ctcggagcca gatcaaggcc gcgctcgaca acgcgaccaa 84 0 

gcgcggtgga ctacctcgtg gggagctccg ttcccgagga 90 0 

ggcaaatttt tgagatgaat ggctacgacc cggcctacgc 96 0 

ggtgtcagcg ctccttcaac aagaggaaca ccgtctggct 102 0 

gcaagaccaa catcgcggag gccatcgccc acactgtgcc 1080 

ggaccaatga aaactttccc tttaatgact gtgtggacaa 114 0 

agggaaagat gaccaacaag gtggttgaat ccgccaaggc 12 0 0 

tgcgggtcga tcagaaatgt aaatcctctg ttcaaattga 1260 

cttccaatac aaacatgtgt gtggtggtgg atgggaattc 132 0 

agccgctgga ggaccgcatg ttcaaatttg aactgactaa 13 8 0 

gcaagattac taagcaggaa gtcaaggact tttttgcttg 144 0 

cggtgactca cgagtttaaa gttcccaggg aattggcggg 1500 

ctctaaaacg cccactgggt gacgtcacca atactagcta 1560 



104 
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taaaagtctg gagaagcggg ccaggctctc atttgttccc gagacgcctc gcagttcaga 
cgtgactgtt gatcccgctc ctctgcgacc gctcaattgg aattcaagat tggttggaag 
aagttggtga 



1620 
1680 
1690 



SEQ ID NO: 38 

ccatcaccaa ggtaaagaag ggcggagcca ataaggtggt ggattctggg tatattcccg 
cctacctgct gccgaaggtc caaccggagc ttcagtgggc gtggacaaac ctggacgagt 
ataaattggc cgccctgaat ctgga 



60 
120 
145 



SEQ ID NO : 3 9 

taagcaggaa gtcaaggact tttttgcttg ggcaaaggtc aatcaggtgc cggtgactca 

cgagtttaaa gttcccaggg aattggcggg aactaaaggg gcggagaaat ctctaaaacg 

cccactgggt gacgtcacca atactagcta taaaagtctg gagaagcggg ccag 



60 
120 
174 



SEQ ID NO: 40 

cactctcaag caagggggtt ttgtaagcag tgatgtcata atgatgtaat gcttattgtc 
acgcgatagt taatgattaa cagtcatgtg atgtgtttta tccaatagga agaaagcgcg 
cgtatgagtt ctcgcgagac ttccggggta taaaagaccg agtgaacgag cccgccgcca 
ttctttg 



60 
120 
180 
187 



SEQ ID NO: 41 

aaacctcctt gcttgagagt gtggcactct cccccctgtc gcgttcgctc gctcgctggc 

tcgtttgggg gggtggcagc tcaaagagct gccagacgac ggccctctgg ccgtcgcccc 

cccaaacgag ccagcgagcg agcgaacgcg acagggggga gagtgcca 



60 
120 
168 



SEQ ID NO: 42 

aaacctcctt gcttgagagt gtggcactct cccccctgtc gcgttcgctc gctcgctggc 

tcgtttgggg gggcgacggc cagagggccg tcgtctgccg gctctttgag ctgccacccc 

cccaaacgag ccagcgagcg agcgaacgcg acagggggga gagtgcca 



60 
120 
168 



SEQ ID NO: 43 

cggtgtga 

SEQ ID NO: 44 

cggttgag 



SEQ ID NO: 45 

caaaacctcc ttgcttgaga g 



21 



SEQ ID NO:46 

ttggccactc 
cgacgcccgg 
gccaactcca 
ggttagggag 
gtggtcacgc 
ggtttgaacg 



233J34_17.DOC 



cctctctgcg 
gctttgcccg 
tcactagggg 
gtcctgtatt 
tgggtattta 
cgcagccgcc 



cgctcgctcg 
ggcggcctca 
ttcctggagg 
agaggtcacg 
agcccgagtg 
atgccggggt 



ctcactgagg 
gtgagcgagc 
ggtggagtcg 
tgagtgtttt 
agcacgcagg 
tttacgagat 

105 



ccgggcgacc 
gagcgcgcag 
tgacgtgaat 
gcgacatttt 
gtctccattt 
tgtgattaag 



aaaggtcgcc 
agagggagtg 
tacgtcatag 
gcgacaccat 
tgaagcggga 
gtccccagcg 



60 
120 
180 
240 
300 
360 
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accttgacgg gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 420 

aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 

ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 540 

cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 

tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 660 

aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 

tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 

ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 

agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 900 

cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 960 

cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 

aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 

atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 

tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 

ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1260 

ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1320 

ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1380 

acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1440 

tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 

tcggaggaag caaggtzgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1560 

ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 

ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 

tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1740 

aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 

gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1860 

agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1920 

gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1980 

atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2040 

tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2100 

atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 216 0 

tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 222 0 

cttccagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2280 

cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2 34 0 

cttcctgggt acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac 2400 

gaggcagacg ccgcggccct cgagcacgta caaagcctac gaccggcagc tcgacagcgg 2460 

agacaacccg tacctcaagt acaaccacgc cgacgcggag tttcaggagc gccttaaaga 2 52 0 

agatacgtct tttgggggca acctcggacg agcagtcttc caggcgaaaa agagggttct 2 580 

tgaacctctg ggcctggttg aggaacctgt taagacggct ccgggaaaaa agaggccggt 2 64 0 

agagcactct cctgtggagc cagactcctc ctcgggaacc ggaaaggcgg gccagcagcc 2700 

tgcaagaaaa agattgaatt ttggtcagac tggagacgca gactcagtac ctgaccccca 2760 

gcctctcgga cagccaccag cagccccctc tggtctggga actaatacga tggctacagg 2 82 0 

cagtggcgca ccaatggcag acaataacga gggcgccgac ggagtgggta attcctccgg 2 880 

aaattggcat tgcgattcca catggatggg cgacagagtc atcaccacca gcacccgaac 2940 

ctgggccctg cccacctaca acaaccacct ctacaaacaa atttccagcc aatcaggagc 3000 

ctcgaacgac aatcactact ttggctacag caccccttgg gggtattttg acttcaacag 3060 

attccactgc cacttttcac cacgtgactg gcaaagactc atcaacaaca actggggatt 3120 

ccgacccaag agactcaact tcaagctctt taacattcaa gtcaaagagg tcacgcagaa 3180 

tgacggtacg acgacgattg ccaataacct taccagcacg gttcaggtgt ttactgactc 3240 

ggagtaccag ctcccgtacg tcctcggctc ggcgcatcaa ggatgcctcc cgccgttccc 3300 

agcagacgtc ttcatggtgc cacagtatgg atacctcacc ctgaacaacg ggagtcaggc 3360 

agtaggacgc tcttcatttt actgcctgga gtactttcct tctcagatgc tgcgtaccgg 3420 

aaacaacttt accttcagct acacttttga ggacgttcct ttccacagca gctacgctca 3480 

cagccagagt ctggaccgtc tcatgaatcc tctcatcgac cagtacctgt attacttgag 3540 

cagaacaaac actccaagtg gaaccaccac gcagtcaagg cttcagtttt ctcaggccgg 3 600 

agcgagtgac attcgggacc agtctaggaa ctggcttcct ggaccctgtt accgccagca 3660 

^fgtg gr gartrca aagacatctg cggataacaa ca-acagtgaa tstctcgtgga ctggagctac 3720 

caagtaccac ctcaatggca gagactctct ggtgaatccg gccatggcaa gccacaagga 3780 

cgatgaagaa aagttttttc ctcagagcgg ggttctcatc tttgggaagc aaggctcaga 3840 

gaaaacaaat gtgaacattg aaaaggtcat gattacagac gaagaggaaa tcggaacaac 3 900 

caatcccgtg gctacggagc agtatggttc tgtatctacc aacctccaga gaggcaacag 3 960 

acaagcagct accgcagatg tcaacacaca aggcgttctt ccaggcatgg tctggcagga 4 020 

cagagatgtg taccttcagg ggcccatctg ggcaaagatt ccacacacgg acggacattt 4080 

tcacccctct cccctcatgg gtggattcgg acttaaacac cctcctccac agattctcat 4140 

caagaacacc ccggtacctg cgaatccttc gaccaccttc agtgcggcaa agtttgcttc 4200 

cttcatcaca cagtactcca cgggacacgg tcagcgtgga gatcgagtgg gagctgcaga 4260 
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aggaaaacag caaacgctgg aatcccgaaa ttcagtacac ttccaactac aacaagtctg 4320 

ttaatcgtgg acttaccgtg gatactaatg gcgtgtattc agagcctcgc cccattggca 4380 

ccagatacct gactcgtaat ctgtaattgc ttgttaatca ataaaccgtt taattcgttt 4440 

cagttgaact ttggtctctg cgtatttctt tcttatctag tttccatggc tacgtagata 4500 

agtagcatgg cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc 4560 

cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 4620 

gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg gccaa 4675 



SEQ XD NO: 47 

BAAV complete genome 

gtggcactcccccccctgtcgcgttcgctcgttcgctggctcgattgggggggtggcagctcaaagagctgcca 

gacgacggccctctgggccgtcgcccccccaatcgagccagcgaacgagcgaacgcgacaggggggggagtgcc 

acactctctagcaagggggttttgtaggtggtgatgtcattgttgatgtcattatagttgtcacgcgatagtta 

atgattaacagtcatgtgatgtgtgttatccaataggatgaaagcgcgcgaatgagatctcgcgagacttccgg 

ggtataaaaggggtgagtgaacgagcccgccgccattctctgctctggactgctagaggaccctcgctgccatg 

gctaccttctatgaagtcattgttcgcgttccatttgatgtggaagagcacctgcctggaatttctgacaactt 

tgtagactgggtaactggtcaaatttgggagctgcctcccgagtcagatttgaatttgactctgattgagcagc 

ctcagctgacggtggctgacagaattcgccgcgtgttcctgtacgagtggaacaaattttccaagcaggagagc 

aaattctttgtgcagtttgaaaagggatctgaatattttcatctgcacacgctcgtggagacctccggcatctc 

ttctatggtccttggccgctacgtgagtcagattcgcgcccagctggtgaaggtggtgttccagaacattgagc 

cgcggattaacgactgggtcgccatcaccaaggtaaagaagggcggagccaataaggtggtggattctgggtat 

attcccgcctacctgctgccgaaggtccaaccagagcttcagtgggcgtggactaacctcgaagagtataaatt 

ggccgccctcaatctggaggagcgcaaacggctcgtcgctcagtttcagcttgagtcctcgcagcgctcgcaag 

aggcatcttcccagagggacgtttcggctgacccggtcatcaagagcaagacttcccagaaatacatggcgctg 

gtaagctggctggtggaacatggcatcacttccgagaagcagtggattcaggagaatcaggagagctacctgtc 

cttcaactccacgggaaactctcggagccagattaaagccgcgcttgacaacgcgtcaaaaattatgagtctga 

ccaaatctgcctcagactatctcgtgggacagactgttccagaggacatttctgaaaacagaatctggcagatt 

tttgatctcaacggctacgacccggcatacgcgggctctgttctctacggctggtgcactcgcgcctttggaaa 

gaggaacaccgtctggctgtatggacccgcgaccaccggaaagaccaacatcgcggaagccatctctcacaccg 

tgcccttttatggctgtgtgaactggactaatgagaactttccctttaatgactgtgtggaaaaaatgttgatc 

tggtgggaggagggaaagatgaccagcaaggtggtggaacccgccaaggccatcttgggggggtctagagtacg 

agtggatcaaaaatgtaaatcctctgtacaagtagactctaccccggtgattatcacctccaatactaacatgt 

gtgtggtggtggatgggaactccacgacctttgaacaccagcagccgctggaagaccgcatgttcagatttgaa 

ctcatgcggcggctcccgccagattttggcaagattaccaagcaggaagtcaaagacttttttgcttgggcaaa 

ggtcaaccaggtgccggtgactcacgagtttatggttcccaagaaagtggcgggaactgagagggcggagactt 

ctagaaaacgcccactggatgacgtcaccaataccaactataaaagtccggagaagcgggcccggctctcagtt 

gttcctgagacgcctcgcagttcagacgtgcctgtagagcccgctcctctgcgacctctcaactggtcttccag 

gtatgaatgcagatgtgactatcatgctaaatttgactctgtaacgggggaatgtgacgagtgtgaatatttga 

atcggggcaaaaatggctgtatctttcataatgctacacattgtcaaatttgtcacgctgttcctccatgggaa 

aaggaaaatgtgtcagattttaatgattttgatgactgtaataaagagcagtaaataaagtgagtagtcatgtc 

ttttgttgaccaccctccagattggttggaatcgatcggcgacggctttcgtgaatttctcggccttgaggcgg 

gtcccccgaaacccaaggccaatcaacagaagcaagataacgctcgaggtcttgtgcttcctgggtacaagtat 

cttggtcctgggaacggccttgataagggcgatcctgtcaattttgctgacgaggttgcccgagagcacgacct: 

ctcctaccagaaacagcttgaggcgggcgataacccttacctcaagtacaaccacgcggacgcagagtttcagg 

agaaactcgcttctgacacttcttttgggggaaaccttgggaaggctgttttccaggctaaaaagaggattctc 

gaacctcttggcctggttgagacgccggataaaacggcgcctgcggcaaaaaagaggcctctagagcagagtcc 

tcaagagccagactcctcgagcggagttggcaagaaaggcaaacagcctgccagaaagagactcaactttgacg 

acgaacctggagccggagacgggcctcccccagaaggaccatcttccggagctatgtctactgagactgaaatg 

cgtgcagcagctggcggaaatggtggcgatgcgggacaaggtgccgagggagtgggtaatgcctccggtgattg 

gcattgcgattccacttggtcagagagccacgtcaccaccacctcaacccgcacctgggtcctgccgacctaca 

acaaccacctgtacctgcggctcggctcgagcaacgccagcgacaccttcaacggattctccaccccctgggga 

tactttgactttaaccgcttccactgccacttctcgccaagagactggcaaaggctcatcaacaaccactgggg 

actgcgccccaaaagcatgcaagtccgcatcttcaacatccaagttaaggaggtcacgacgtctaacggggaga 

cgaccgtatccaacaacctca^caycQ^L^uayauemgcgg 

gatgcaggtcaggagggcagcttgcctcctttccccaacgacgtgttcatggtgcctcagtacgggtactgcgg 
actggtaaccggaggcagctctcaaaaccagacagacagaaatgccttctactgtctggagtactttcccagcc 
agatgctgagaaccggaaacanctlrtgagatggtrgranaagtttgaaaacgtgcccttccactccatgtacgct 
cacagccagagcctggataggctgatgaacccgctgctggaccagtacctgtgggagctccagtctaccacctc 
tggaggaactctcaaccagggcaattcagccaccaactttgccaagctgaccaaaacaaacttttctggctacc 
gcaaaaactggctcccggggcccatgatgaagcagcagagattctccaagactgccagtcaaaactacaagatt 
ccccagggaagaaacaacagtctgctccattatgagaccagaactaccctcgacggaagatggagcaattttgc 
cccgggaacggccatggcaaccgcagccaacgacgccaccgacttctctcaggcccagctcatctttgcggggc 

107 

233334_I7.DOC 



ATTORNEY DOCKET NO. 14014.0427U1 



ccaacatcaccggcaacaccaccacagatgccaataacctgatgttcacttcagaagatgaacttagggccacc 
aacccccgggacaccgacctgtttggccacctggcaaccaaccagcaaaacgccaccaccgttcctaccgtaga 
cgacgtggacggagtcggcgtgtacccgggaatggtgtggcaggacagagacatttactaccaagggcccattt 
gggccaaaattccacacacggatggacactttcacccgtctcctctcattggcggatttggactgaaaagcccg 
cctccacaaatattcatcaaaaacactcctgtacccgccaatcccgcaacgaccttctctccggccagaatcaa 
cagcttcatcacccagtacagcaccggacaggtggctgtcaaaatagaatgggaaatccagaaggagcggtcca 
agagatggaacccagaggtccagttcacgtccaactacggagcacaggactcgcttctctgggctcccgacaae 
gccggagcctacaaagagcccagggccattggatcccgatacctcaccaaccacctctagcccaattctgttgc 
ataccctcaataaaccgtgtattcgtttcagtaaaatactgcctcttgtggtcattcggcgtacaacagcttac 
aacaacaacaaaacccccttgctagagagtgtggcactcccccccctgtcgcgttcgctcgttcgctggctcga 
ttgggggggtggcagctcaaagagctgccagacgacggccctctgggccgtcgcccccccaatcgagccagcga 
acgagcgaacgcgacaggggggggagtgccac 



SEQ ID NOx48 
BAAV rep7 8 ORF 

atggctaccttctatgaagtcattgttcgcgttccatttgatgtggaagagcacctgcctggaatttctgacaa 
ctttgtagactgggtaactggtcaaatttgggagctgcctcccgagtcagatttgaatttgactctgattgagc 
agcctcagctgacggtggctgacagaattcgccgcgtgttcctgtacgagtggaacaaattttccaagcaggag 
agcaaattctttgtgcagtttgaaaagggatctgaatattttcatctgcacacgctcgtggagacctccggcat 
ctcttctatggtccttggccgctacgtgagtcagattcgcgcccagctggtgaaggtggtgttccagaacattg 
agccgcggattaacgactgggtcgccatcaccaaggtaaagaagggcggagccaataaggtggtggattctggg 
tatattcccgcctacctgctgccgaaggtccaaccagagcttcagtgggcgtggactaacctcgaagagtataa 
attggccgccctcaatctggaggagcgcaaacggctcgtcgctcagtttcagcttgagtcctcgcagcgctcgc 
aagaggcatcttcccagagggacgtttcggctgacccggtcatcaagagcaagacttcccagaaatacatggcg 
ctggtaagctggctggtggaacatggcatcacttccgagaagcagtggattcaggagaatcaggagagctacct 
gtccttcaactccacgggaaactctcggagccagattaaagccgcgcttgacaacgcgtcaaaaattatgagtc 
tgaccaaatctgcctcagactatctcgtgggacagactgttccagaggacatttctgaaaacagaatctggcag 
atttttgatctcaacggctacgacccggcatacgcgggctctgttctctacggctggtgcactcgcgcctttgg 
aaagaggaacaccgtctggctgtatggacccgcgaccaccggaaagaccaacatcgcggaagccatctctcaca 
ccgtgcccttttatggctgtgtgaactggactaatgagaactttccctttaatgactgtgtggaaaaaatgttg 
atctggtgggaggagggaaagatgaccagcaaggtggtggaacccgccaaggccatcttgggggggtctagagt 
acgagtggatcaaaaatgtaaatcctctgtacaagtagactctaccccggtgattatcacctccaatactaaca 
tgtgtgtggtggtggatgggaactccacgacctttgaacaccagcagccgctggaagaccgcatgttcagattt 
gaactcatgcggcggctcccgccagattttggcaagattaccaagcaggaagtcaaagacttttttgcttgggc 
aaaggtcaaccaggtgccggtgactcacgagtttatggttcccaagaaagtggcgggaactgagagggcggaga 
cttctagaaaacgcccactggatgacgtcaccaataccaactataaaagtccggagaagcgggcccggctctca 
gttgttcctgagacgcctcgcagttcagacgtgcctgtagagcccgctcctctgcgacctctcaactggtcttc 
caggtatgaatgcagatgtgactatcatgctaaatttgactctgtaacgggggaatgtgacgagtgtgaatatt 
tgaatcggggcaaaaatggctgtatctttcataatgctacacattgtcaaatttgtcacgctgttcctccatgg 
gaaaaggaaaatgtgtcagattttaatgattttgatgactgtaataaagagcagtaa 



SEQ ID NO: 49 
BAAV Rep7 8 

MATFYEVIVRVPFDVEEHLPGISDNFVDWVTGQIWELPPESDLNLTLIEQPQLTVADRIRRVFIiYEWNKFSKQE 
SKFFVQFEKGSEYFHLHTLVETSGISSMVLGRYVSQIRAQIiVKWFQNIEPRINDWVAITKVKKGGANKVVDSG 
YIPAYLLPKVQPELQWAWTNLEEYKIjAAXiNLEERKRIjVAQFQLESSQRSQEASSQRDVSADPVIKSKTSQKYMA 
LVSWLVEHGITSEKQWIQENQESYIiSFNSTGNSRSQIKAALDNASKIMSLTKSASDYLVGQTVPEDISENRIWQ 
IFDLNGYDPAYAGSVLYGWCTRAFGKRNTWLYGPATTGKTNIAEAISHTVPFYGCVNWTNENFPFNDCVEKML 
IWWEEGKMTSKWEPAKAILGGSRVRVDQKCKSSVQVDSTPVI ITSNTNMCVWDGNSTTFEHQQPIjEDRMFRF 
ELMRRLPPDFGKITKQEVKDFFAWAKWQVPVTHEFMVPKKVAGTERAETSRKRPLDDVTNTNYKSPEKRARLS 
WPETPRSSDVPVEPAPLRPL^SSRYECRCDYHAKFDSVTGECDECEYLNRGKNGCIFHNATHCQICHAVPPW 

EKEWVSDFND FDDCNKEQ * 



SEQ ID NO: 50 
BAAV rep5 2 ORF 

atggcgctggtaagctggctggtggaacatggcatcacttccgagaagcagtggattcaggagaatcaggagag 
ctacctgtccttcaactccacgggaaactctcggagccagattaaagccgcgcttgacaacgcgtcaaaaatta 
tgagtctgaccaaatctgcctcagactatctcgtgggacagactgttccagaggacatttctgaaaacagaatc 
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tggcagatttttgatctcaacggctacgacccggcatacgcgggctctgttctctacggctggtgcactcgcgc 
ctttggaaagaggaacaccgtctggctgtatggacccgcgaccaccggaaagaccaacatcgcggaagccatct 
ctcacaccgtgcccttttatggctgtgtgaactggactaatgagaactttccctttaatgactgtgtggaaaaa 
atgttgatctggtgggaggagggaaagatgaccagcaaggtggtggaacccgccaaggccatcttgggggggtc 
tagagtacgagtggatcaaaaatgtaaatcctctgtacaagtagactctaccccggtgattatcacctccaata 
ctaacatgtgtgtggtggtggatgggaactccacgacctttgaacaccagcagccgctggaagaccgcatgttc 
agatttgaactcatgcggcggctcccgccagattttggcaagattaccaagcaggaagtcaaagacttttttgc 
ttgggcaaaggtcaaccaggtgccggtgactcacgagtttatggttcccaagaaagtggcgggaactgagaggg 
cggagacttctagaaaacgcccactggatgacgtcaccaataccaactataaaagtccggagaagcgggcccgg 
ctctcagttgttcctgaga-cgcctcgcagttcagacgtgcctgtagagcccgctcctctgcgacctctcaactg 
gtcttccaggtatgaatgcagatgtgactatcatgctaaatttgactctgtaacgggggaatgtgacgagtgtg 
aatatttgaatcggggcaaaaatggctgtatctttcataatgctacacattgtcaaatttgtcacgctgttcct 
ccatgggaaaaggaaaatgtgtcagattttaatgattttgatgactgtaataaagagcagtaa 



SEQ ID NO: 51 
BAAV REP 5 2 

MALVSWLVEHGITSEKQWIQENQESYLSFNSTGNSRSQIKAALDNASKIMSLTKSASDYLVGQTVPEDISENRI 

WQIFDLNGYDPAYAGSVLYGWCTRAFGKRNTVWLYGPATTGKTNIAEAISHTVPFYGCVNWTNENFPFNDCVEK 

MLIWWEEGKMTSKWEPAKAILGGSRVRVDQKCKSSVQVDSTPVIITSNTNMCVVVDGNSTTFEHQQPLEDRM 

RFELMRRLPPDFGKITKQEVKDFFAWAKVNQVPVTHEFMVPKKVAGTERAETSRKRPLDDVTNTNYKSPEKIU^R 

LSWPETPRSSDVPVEPAPLRPLNWSSRYECRCDYHAKFDSVTGECDECEYLNRGKNGCIFHNATHCQICHAVP 

P WE KENVS DFND FDDCNKEQ * 



SEQ ID NO: 52 
BAAV vpl ORF 

atgtcttttgttgaccaccctccagattggttggaatcgatcggcgacggctttcgtgaatttctcggccttga 
ggcgggtcccccgaaacccaaggccaatcaacagaagcaagataacgctcgaggtcttgtgcttcctgggtaca 
agtatcttggtcctgggaacggccttgataagggcgatcctgtcaattttgctgacgaggttgcccgagagcac 
gacctctcctaccagaaacagcttgaggcgggcgataacccttacctcaagtacaaccacgcggacgcagagtt 
tcaggagaaactcgcttctgacacttcttttgggggaaaccttgggaaggctgttttccaggctaaaaagagga 
ttctcgaacctcttggcctggttgagacgccggataaaacggcgcctgcggcaaaaaagaggcctctagagcag 
agtcctcaagagccagactcctcgagcggagttggcaagaaaggcaaacagcctgccagaaagagactcaactt 
tgacgacgaacctggagccggagacgggcctcccccagaaggaccatcttccggagctatgtctactgagactg 
aaatgcgtgcagcagctggcggaaatggtggcgatgcgggacaaggtgccgagggagtgggtaatgcctccggt 
gattggcattgcgattccacttggtcagagagccacgtcaccaccacctcaacccgcacctgggtcctgccgac 
ctacaacaaccacctgtacctgcggctcggctcgagcaacgccagcgacaccttcaacggattctccaccccct 
ggggatactttgactttaaccgcttccactgccacttctcgccaagagactggcaaaggctcatcaacaaccac 
tggggactgcgccccaaaagcatgcaagtccgcatcttcaacatccaagttaaggaggtcacgacgtctaacgg 
ggagacgaccgtatccaacaacctcaccagcacggtccagatctttgcggacagcacgtacgagctcccgtacg 
tgatggatgcaggtcaggagggcagcttgcctcctttccccaacgacgtgttcatggtgcctcagtacgggtac 
tgcggactggtaaccggaggcagctctcaaaaccagacagacagaaatgccttctactgtctggagtactttcc 
cagccagatgctgagaaccggaaacaactttgagatggtgtacaagtttgaaaacgtgcccttccactccatgt 
acgctcacagccagagcctggataggctgatgaacccgctgctggaccagtacctgtgggagctccagtctacc 
acctctggaggaactctcaaccagggcaattcagccaccaactttgccaagctgaccaaaacaaacttttctgg 
ctaccgcaaaaactggctcccggggcccatgatgaagcagcagagattctccaagactgccagtcaaaactaca 
agattccccagggaagaaacaacagtctgctccattatgagaccagaactaccctcgacggaagatggagcaat 
tttgccccgggaacggccatggcaaccgcagccaacgacgccaccgacttctctcaggcccagctcatctttgc 
ggggcccaacatcaccggcaacaccaccacagatgccaataacctgatgttcacttcagaagatgaacttaggg 
ccaccaacccccgggacactgacctgtttggccacctggcaaccaaccagcaaaacgccaccaccgttcctacc 
gtagacgacgtggacggagtcggcgtgtacccgggaatggtgtggcaggacagagacatttactaccaagggcc 
catttgggccaaaattccacacacggatggacactttcacccgtctcctctcattggcggatttggactgaaaa 
gcccgcctccacaaatattcatcaa^aacarer^^ 

atcaacagcttcatcacccagtacagcaccggacaggtggctgtcaaaatagaatgggaaatccagaaggagcg 
gtccaagagatggaacccagaggtccagttcacgtccaactacggagcacaggactcgcttctctgggctcccg 
acaacgccggagcctacaaagagcccagggccatt-ggatcccgatacctcaccaaccacctctag 



SEQ ZD NO: 53 
BAAV Vpl 
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MSFVDHPPDWLESIGDGFREFLGIiEAGPPKPKANQQKQDNARGLVLPGYKYLGPGNGLDKGDPVNFADEVAREH 
DLSYQKQLEAGDNPYIiKYNHADAEFQEKLASDTSFGGNLGKAVFQAKKRILEPLGLVETPDKTAPAAKKRPLEQ 
S PQEPDS S SG VGKKGKQPARKRLNFDDEPGAGDGPPPEGPS SGAMSTETEMRAAAGGNGGDAGQGAEGVGNASG 
DWHCDSTWSESHVTTTSTRTWVLPTYNiraLYLRLGSSNASDTFNGFSTPWGYFDFNRFHCHFSPRDWQRIilNNH 
WGLRPKSMQVRIFNIQVKEVTTSNGETTVSNNLTSTVQIFADSTYELPYVMDAGQEGSLPPFPNDVFMVPQYGY 
CGLVTGGSSQNQTDRNAFYCLEYFPSQMLRTGl^FEMVYKFEl^PFHSMYAHSQSLDRIiMNPL,LDQYLWELQST 
TSGGTLNQGNSATNFAKLTKTNFSGYRKNWLPGPMMKQQRFSKTASQNYKIPQGRNNSLLHYETRTTLDGRWSN 

F APGTAMATAAND ATD FS QAQL I FAG PN I TGNTTTD ANNLMFTS EDE LRATNPRDTDL FGHLATNQQN ATTVPT 
WDVDGVGVYPGMVWQDRDIYYQGPIWAKIPHTDGHFHPSPLIGGFGLKSPPPQIFIKNTPVPANPATTFSPAR 
INSFITQYSTGQVAVKIEWEIQKERSKRWNPEVQFTSNYGAQDSLLWAPDNAGAYKEPRAIGSRYLTNHL* 



SEQ ID NO: 54 
BAAV vp2 ORF 

acggcgcctgcggcaaaaaagaggcctctagagcagagtcctcaagagccagactcctcgagcggagttggcaa 
gaaaggcaaacagcctgccagaaagagactcaactttgacgacgaacctggagccggagacgggcctcccccag 
aaggaccatcttccggagctatgtctactgagactgaaatgcgtgcagcagctggcggaaatggtggcgatgcg 
ggacaaggtgccgagggagtgggtaatgcctccggtgattggcattgcgattccacttggtcagagagccacgt 
caccaccacctcaacccgcacctgggtcctgccgacctacaacaaccacctgtacctgcggctcggctcgagca 
acgccagcgacaccttcaacggattctccaccccctggggatactttgactttaaccgcttccactgccacttc 
tcgccaagagactggcaaaggctcatcaacaaccactggggactgcgccccaaaagcatgcaagtccgcatctt 
caacatccaagttaaggaggtcacgacgtctaacggggagacgaccgtatccaacaacctcaccagcacggtcc 
agatctttgcggacagcacgtacgagctcccgtacgtgatggatgcaggtcaggagggcagcttgcctcctttc 
cccaacgacgtgttcatggtgcctcagtacgggtactgcggactggtaaccggaggcagctctcaaaaccagac 
agacagaaatgccttctactgtctggagtactttcccagccagatgctgagaaccggaaacaactttgagatgg 
tgtacaagtttgaaaacgtgcccttccactccatgtacgctcacagccagagcctggataggctgatgaacccg 
ctgctggaccagtacctgtgggagctccagtctaccacctctggaggaactctcaaccagggcaattcagccac 
caactttgccaagctgaccaaaacaaacttttctggctaccgcaaaaactggctcccggggcccatgatgaagc 
agcagagattctccaagactgccagtcaaaactacaagattccccagggaagaaacaacagtctgctccattat 
gagaccagaactaccctcgacggaagatggagcaattttgccccgggaacggccatggcaaccgcagccaacga 
cgccaccgacttctctcaggcccagctcatctttgcggggcccaacatcaccggcaacaccaccacagatgcca 
ataacctgatgttcacttcagaagatgaacttagggccaccaacccccgggacactgacctgtttggccacctg 
gcaaccaaccagcaaaacgccaccaccgttcctaccgtagacgacgtggacggagtcggcgtgtacccgggaat 
ggtgtggcaggacagagacatttactaccaagggcccatttgggccaaaattccacacacggatggacactttc 
acccgtctcctctcattggcggatttggactgaaaagcccgcctccacaaatattcatcaaaaacactcctgta 
cccgccaatcccgcaacgaccttctctccggccagaatcaacagcttcatcacccagtacagcaccggacaggt 
ggctgtcaaaatagaatgggaaatccagaaggagcggtccaagagatggaacccagaggtccagttcacgtcca 
actacggagcacaggactcgcttctctgggctcccgacaacgccggagcctacaaagagcccagggccattgga 
tcccgatacctcaccaaccacctctag 



SEQ ID NO: 55 
BAAV Vp2 

TAPAAKKRPLEQSPQEPDSSSGVGKKGKQPARKRLNFDDEPGAGDGPPPEGPSSGAMSTETEMRAAAGGNGGDA 
GQGAEGVGNASGDWHCDSTWSESHVTTTSTRTWVLPTYNNHLYLRLGSSNASDTFNGFSTPWGYFDFNRFHCHF 
SPRDWQRDINNHWGLRPKSMQVRIFNIQVKEVTTSNGETTVSNNLTSTVQIFADSTYELPYVMDAGQEGSLPPF 
PNDVFMVPQYGYCGLVTGGSSQNQTDRNAFYCLEYFPSQMLRTGNNFEMVYKFENVPFHSMYAHSQSLDRLMNP 
LLDQYLWEL.QSTTSGGTLNQGNSATOFAKLTKTNFSGYRKNWLPGPMMKQQRFSKTASQNYKIPQGRNNSIiL.HY 
ETRTTLDGRWSNFAPGTAMATAANDATDFSQAQLIFAGPNITGNTTTD ANNLMFTS EDELRATNPRDTDLFGHL 
ATNQQNATTVPTVDDVDGVGVYPGMVWQDRDIYYQGPIWAKIPHTDGHFHPSPLIGGFGLKSPPPQIFIKNTPV 
PANPATTFSPARINSFITQYSTGQVAVKIEWEIQKERSKRWNPEVQFTSNYGAQDSLLWAPDNAGAYKEPRAIG 

SRYLTNHL 



SEQ ID NO: 56 
BAAV vp3 ORF 

atgcgtgcagcagctggcggaaatggtggcgatgcgggacaaggtgccgagggagtgggtaatgcctccggtga 
ttggcattgcgattccacttggtcagagagccacgtcaccaccacctcaacccgcacctgggtcctgccgacct 
acaacaaccacctgtacctgcggctcggctcgagcaacgccagcgacaccttcaacggattctccaccccctgg 
ggatactttgactttaaccgcttccactgccacttctcgccaagagactggcaaaggctcatcaacaaccactg 
gggactgcgccccaaaagcatgcaagtccgcatcttcaacatccaagttaaggaggtcacgacgtctaacgggg 
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agacgaccgtatccaacaacctcaccagcacggtccagatctttgcggacagcacgtacgagctcccgtacgtg 
atggatgcaggtcaggagggcagcttgcctcctttccccaacgacgtgttcatggtgcctcagtacgggtactg 
cggactggtaaccggaggcagctctcaaaaccagacagacagaaatgccttctactgtctggagtactttccca 
gccagatgctgagaaccggaaacaactttgagatggtgtacaagtttgaaaacgtgcccttccactccatgtac 
gctcacagccagagcctggataggctgatgaacccgctgctggaccagtacctgtgggagctccagtctaccac 
ctctggaggaactctcaaccagggcaattcagccaccaactttgccaagctgaccaaaacaaacttttctggct 
accgcaaaaactggctcccggggcccatgatgaagcagcagagattctccaagactgccagtcaaaactacaag 
attccccagggaagaaacaacagtctgctccattatgagaccagaactaccctcgacggaagatggagcaattt 
tgccccgggaacggccatggcaaccgcagccaacgacgccaccgacttctctcaggcccagctcatctttgcgg 
ggcccaacatcaccggcaacaccaccacagatgccaataacctgatgttcacttcagaagatgaacttagggcc 
accaacccccgggacactgacctgtttggccacctggcaaccaaccagcaaaacgccaccaccgttcctaccgt 
agacgacgtggacggagtcggcgtgtacccgggaatggtgtggcaggacagagacatttactaccaagggccca 
tttgggccaaaattccacacacggatggacactttcacccgtctcctctcattggcggatttggactgaaaagc 
ccgcctccacaaatattcatcaaaaacactcctgtacccgccaatcccgcaacgaccttctctccggccagaat 
caacagcttcatcacccagtacagcaccggacaggtggctgtcaaaatagaatgggaaatccagaaggagcggt 
ccaagagatggaacccagaggtccagttcacgtccaactacggagcacaggactcgcttctctgggctcccgac 
aacgccggagcctacaaagagcccagggccattggatcccgatacctcaccaaccacctctag 



SEQ ID NO: 57 
BAAV Vp3 

MRAAAGGNGGDAGQGAEGVGNASGDWHCDSTWSESHVTTTSTRTWVLPTYNNHLYLRLGSSNASDTFNGFSTPW 
GYFDFNRFHCHFSPRDWQRLINNHWGLRPKSMQVRIFNIQVKEVTTSNGETTVSNNLTSTVQIFADSTYELPYV 
MDAGQEGSLPPFPNDVFMVPQYGYCGLVTGGSSQNQTDRNAFYCIiEYFPSQMLiRTGNNFEWVrYKFENVPFHSMY 
AHSQSLDRLMNPLLDQYLWELQSTTSGGTIjNQGNSATNFAKLTKTNFSGYRKNWLPGPMMKQQRFSKTASQNYK 
IPQGRNNSLLHYETRTTLDGRWSNFAPGTAMATAANDATDFSQAQLIFAGPNITGNTTTDANNLMFTSEDELRA 
TNPRDTDLFGHLATNQQNATTVPT VDDVDGVGVY PGMVWQDRD I YYQGP I WAKI PHTDGHFHP S PL I GGFGL KS 
PPPQIFIKNTPVPANPATTFSPARINSFITQYSTGQVAVKIEWEIQKERSKRWNPEVQFTSNYGAQDSLLWAPD 

NAGAYKEPRAIGSRYLTNHL* 



SEQ ID NO: 58 
ITR 

gtggcactcccccccctgtcgcgttcgctcgttcgctggctcgattgggggggtggcagctcaaagagctgcca 
gacgacggccctctgggccgtcgcccccccaatcgagccagcgaacgagcgaacgcgacaggggggggagtgcc 

ac 

SEQ ID NO i 59 
D-region 

ctctagcaagggggttttgt 



SEQ ID NO: 60 
TRS 

agtgtgg 



SEQ ID NO: 61 
BAAV p5 promoter 

aggtggtgatgtcattgttgatgtcattatagttgtcacgcgatagttaatgattaacagtcatgtgatgtgtg 
ttatccaataggatgaaagcgcgcgaatgagatctcgcgagacttccggggtataaaaggggtgagtgaacgag 

cccgccgcca 

SEQ ID NO: 62 
BAAV pi 9 promoter 

ggtggattctgggtatattcccgcctacctgctgccgaaggtccaaccagagcttcagtgggcgtggactaacc 
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tcgaagagtataaattggccgccctcaatctggaggag 

SEQ ID NO: 63 
BAAV p4 0 promoter 

agtcaaagacttttttgcttgggcaaaggtcaaccaggtgccggtgactcacgagtttatggttcccaagaaag 
tggcgggaactgagagggcggagacttctagaaaacgcccactggatgacgtcaccaataccaactataaaagt 
ccggagaagcgggcccggctc 
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